Agent skill

nanobanana-skill

Generate or edit images using Google Gemini API via nanobanana. Use when the user asks to create, generate, edit images with nanobanana, or mentions image generation/editing tasks.

Stars 152
Forks 20

Install this agent skill to your Project

npx add-skill https://github.com/Microck/ordinary-claude-skills/tree/main/skills_all/nanobanana-skill

SKILL.md

Nanobanana Image Generation Skill

Generate or edit images using Google Gemini API through the nanobanana tool.

Requirements

  1. GEMINI_API_KEY: Must be configured in ~/.nanobanana.env or export GEMINI_API_KEY=<your-api-key>
  2. Python3 with depedent packages installed: google-genai, Pillow, python-dotenv. They could be installed via python3 -m pip install -r ${CLAUDE_PLUGIN_ROOT}/skills/nanobanana-skill/requirements.txt if not installed yet.
  3. Executable: ${CLAUDE_PLUGIN_ROOT}/skills/nanobanana-skill/nanobanana.py

Instructions

For image generation

  1. Ask the user for:

    • What they want to create (the prompt)
    • Desired aspect ratio/size (optional, defaults to 9:16 portrait)
    • Output filename (optional, auto-generates UUID if not specified)
    • Model preference (optional, defaults to gemini-3-pro-image-preview)
    • Resolution (optional, defaults to 1K)
  2. Run the nanobanana script with appropriate parameters:

    bash
    python3 ${CLAUDE_PLUGIN_ROOT}/skills/nanobanana-skill/nanobanana.py --prompt "description of image" --output "filename.png"
    
  3. Show the user the saved image path when complete

For image editing

  1. Ask the user for:

    • Input image file(s) to edit
    • What changes they want (the prompt)
    • Output filename (optional)
  2. Run with input images:

    bash
    python3 ${CLAUDE_PLUGIN_ROOT}/skills/nanobanana-skill/nanobanana.py --prompt "editing instructions" --input image1.png image2.png --output "edited.png"
    

Available Options

Aspect Ratios (--size)

  • 1024x1024 (1:1) - Square
  • 832x1248 (2:3) - Portrait
  • 1248x832 (3:2) - Landscape
  • 864x1184 (3:4) - Portrait
  • 1184x864 (4:3) - Landscape
  • 896x1152 (4:5) - Portrait
  • 1152x896 (5:4) - Landscape
  • 768x1344 (9:16) - Portrait (default)
  • 1344x768 (16:9) - Landscape
  • 1536x672 (21:9) - Ultra-wide

Models (--model)

  • gemini-3-pro-image-preview (default) - Higher quality
  • gemini-2.5-flash-image - Faster generation

Resolution (--resolution)

  • 1K (default)
  • 2K
  • 4K

Examples

Generate a simple image

bash
python3 ${CLAUDE_PLUGIN_ROOT}/skills/nanobanana-skill/nanobanana.py --prompt "A serene mountain landscape at sunset with a lake"

Generate with specific size and output

bash
python3 ${CLAUDE_PLUGIN_ROOT}/skills/nanobanana-skill/nanobanana.py \
  --prompt "Modern minimalist logo for a tech startup" \
  --size 1024x1024 \
  --output "logo.png"

Generate landscape image with high resolution

bash
python3 ${CLAUDE_PLUGIN_ROOT}/skills/nanobanana-skill/nanobanana.py \
  --prompt "Futuristic cityscape with flying cars" \
  --size 1344x768 \
  --resolution 2K \
  --output "cityscape.png"

Edit existing images

bash
python3 ${CLAUDE_PLUGIN_ROOT}/skills/nanobanana-skill/nanobanana.py \
  --prompt "Add a rainbow in the sky" \
  --input photo.png \
  --output "photo-with-rainbow.png"

Use faster model

bash
python3 ${CLAUDE_PLUGIN_ROOT}/skills/nanobanana-skill/nanobanana.py \
  --prompt "Quick sketch of a cat" \
  --model gemini-2.5-flash-image \
  --output "cat-sketch.png"

Error Handling

If the script fails:

  • Check that GEMINI_API_KEY is exported or set in ~/.nanobanana.env
  • Verify input image files exist and are readable
  • Ensure the output directory is writable
  • If no image is generated, try making the prompt more specific about wanting an image

Best Practices

  1. Be descriptive in prompts - include style, mood, colors, composition
  2. For logos/graphics, use square aspect ratio (1024x1024)
  3. For social media posts, use 9:16 for stories or 1:1 for posts
  4. For wallpapers, use 16:9 or 21:9
  5. Start with 1K resolution for testing, upgrade to 2K/4K for final output
  6. Use gemini-3-pro-image-preview for best quality, gemini-2.5-flash-image for speed

Expand your agent's capabilities with these related and highly-rated skills.

Microck/ordinary-claude-skills

nondominium-holochain-dna-dev

Specialized skill for nondominium Holochain DNA development, focusing on zome creation, entry patterns, integrity/coordinator architecture, ValueFlows compliance, and WASM optimization. Use when creating new zomes, implementing entry types, or modifying Holochain DNA code.

152 20
Explore
Microck/ordinary-claude-skills

fluidsim

Framework for computational fluid dynamics simulations using Python. Use when running fluid dynamics simulations including Navier-Stokes equations (2D/3D), shallow water equations, stratified flows, or when analyzing turbulence, vortex dynamics, or geophysical flows. Provides pseudospectral methods with FFT, HPC support, and comprehensive output analysis.

152 20
Explore
Microck/ordinary-claude-skills

metabolomics-workbench-database

Access NIH Metabolomics Workbench via REST API (4,200+ studies). Query metabolites, RefMet nomenclature, MS/NMR data, m/z searches, study metadata, for metabolomics and biomarker discovery.

152 20
Explore
Microck/ordinary-claude-skills

run-tests

Validate code changes by intelligently selecting and running the appropriate test suites. Use this when editing code to verify changes work correctly, run tests, validate functionality, or check for regressions. Automatically discovers affected test suites, selects the minimal set of venvs needed for validation, and handles test execution with Docker services as needed.

152 20
Explore
Microck/ordinary-claude-skills

skill-navigator

The 100th skill! Your intelligent guide to all 99 other skills. Recommends the perfect skill for any task, creates skill combinations, and helps you discover capabilities you didn't know you had.

152 20
Explore
Microck/ordinary-claude-skills

AgentDB Advanced Features

Master advanced AgentDB features including QUIC synchronization, multi-database management, custom distance metrics, hybrid search, and distributed systems integration. Use when building distributed AI systems, multi-agent coordination, or advanced vector search applications.

152 20
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results