Agent skill

nanobanana-skill

Generate or edit images using Google Gemini API via nanobanana. Triggers: "nanobanana", "generate image", "create image", "edit image", "AI drawing", "图片生成", "AI绘图", "图片编辑", "生成图片".

View SKILL.md on GitHub Repository

Stars 1,393

Forks 207

Install this agent skill to your Project

npx add-skill https://github.com/feiskyer/claude-code-settings/tree/main/plugins/nanobanana-skill/skills/nanobanana-skill

SKILL.md

Nanobanana Image Generation Skill

Generate or edit images using Google Gemini API through the nanobanana tool.

Requirements

GEMINI_API_KEY: Must be configured in ~/.nanobanana.env or export GEMINI_API_KEY=<your-api-key>
Python3 with dependent packages installed: google-genai, Pillow, python-dotenv. They could be installed via python3 -m pip install -r ${CLAUDE_PLUGIN_ROOT}/skills/nanobanana-skill/requirements.txt if not installed yet.
Executable: ${CLAUDE_PLUGIN_ROOT}/skills/nanobanana-skill/nanobanana.py

Instructions

For image generation

Ask the user for:
- What they want to create (the prompt)
- Desired aspect ratio/size (optional, defaults to 9:16 portrait)
- Output filename (optional, auto-generates UUID if not specified)
- Model preference (optional, defaults to gemini-3.1-flash-image-preview)
- Resolution (optional, defaults to 1K)

Run the nanobanana script with appropriate parameters:

bash

python3 ${CLAUDE_PLUGIN_ROOT}/skills/nanobanana-skill/nanobanana.py --prompt "description of image" --output "filename.png"

Show the user the saved image path when complete

For image editing

Ask the user for:
- Input image file(s) to edit
- What changes they want (the prompt)
- Output filename (optional)

Run with input images:

bash

python3 ${CLAUDE_PLUGIN_ROOT}/skills/nanobanana-skill/nanobanana.py --prompt "editing instructions" --input image1.png image2.png --output "edited.png"

Available Options

Aspect Ratios (--size)

1024x1024 (1:1) - Square
832x1248 (2:3) - Portrait
1248x832 (3:2) - Landscape
864x1184 (3:4) - Portrait
1184x864 (4:3) - Landscape
896x1152 (4:5) - Portrait
1152x896 (5:4) - Landscape
768x1344 (9:16) - Portrait (default)
1344x768 (16:9) - Landscape
1536x672 (21:9) - Ultra-wide

Models (--model)

gemini-3.1-flash-image-preview (default) - Latest, fast generation
gemini-3-pro-image-preview - Higher quality, supports thinking/reasoning

Resolution (--resolution)

1K (default)
2K
4K

Other Options

--no-search - Disable Google Search grounding (enabled by default)
--no-think - Disable thinking/reasoning mode

Examples

Generate a simple image

bash

python3 ${CLAUDE_PLUGIN_ROOT}/skills/nanobanana-skill/nanobanana.py --prompt "A serene mountain landscape at sunset with a lake"

Generate with specific size and output

bash

python3 ${CLAUDE_PLUGIN_ROOT}/skills/nanobanana-skill/nanobanana.py \
  --prompt "Modern minimalist logo for a tech startup" \
  --size 1024x1024 \
  --output "logo.png"

Generate landscape image with high resolution

bash

python3 ${CLAUDE_PLUGIN_ROOT}/skills/nanobanana-skill/nanobanana.py \
  --prompt "Futuristic cityscape with flying cars" \
  --size 1344x768 \
  --resolution 2K \
  --output "cityscape.png"

Edit existing images

bash

python3 ${CLAUDE_PLUGIN_ROOT}/skills/nanobanana-skill/nanobanana.py \
  --prompt "Add a rainbow in the sky" \
  --input photo.png \
  --output "photo-with-rainbow.png"

Use pro model for higher quality

bash

python3 ${CLAUDE_PLUGIN_ROOT}/skills/nanobanana-skill/nanobanana.py \
  --prompt "Detailed portrait of a cat in watercolor style" \
  --model gemini-3-pro-image-preview \
  --output "cat-portrait.png"

Error Handling

If the script fails:

Check that GEMINI_API_KEY is exported or set in ~/.nanobanana.env
Verify input image files exist and are readable
Ensure the output directory is writable
If no image is generated, try making the prompt more specific about wanting an image

Best Practices

Be descriptive in prompts - include style, mood, colors, composition
For logos/graphics, use square aspect ratio (1024x1024)
For social media posts, use 9:16 for stories or 1:1 for posts
For wallpapers, use 16:9 or 21:9
Start with 1K resolution for testing, upgrade to 2K/4K for final output
Use gemini-3-pro-image-preview for best quality, gemini-3.1-flash-image-preview (default) for speed

Maintainer

feiskyer Core maintainer

Source details

Full Name: feiskyer/claude-code-settings
Branch: main
Path in repo: plugins/nanobanana-skill/skills/nanobanana-skill
License: MIT License
Topics: ai claude-code kiro agents codex claude-skills vibe-coding agentic-ai copilot litellm spec-driven-development

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

feiskyer/claude-code-settings

kiro-skill

Interactive feature development workflow from idea to implementation. Creates requirements (EARS format), design documents, and task lists. Triggers: "kiro", ".kiro/specs/", "feature spec", "需求文档", "设计文档", "实现计划".

1,393 207

Explore

feiskyer/claude-code-settings

github-review-pr

Review GitHub pull requests with detailed, multi-perspective code analysis using parallel subagents. Use this skill whenever the user wants to review a PR, asks for code review on a pull request, mentions "review PR", "check this PR", "look at pull request", or references a PR number or GitHub PR URL. Also trigger when the user wants feedback on code changes, wants to approve or request changes on a PR, or asks to review someone's contribution.

1,393 207

Explore

feiskyer/claude-code-settings

autonomous-skill

Execute long-running, multi-session tasks autonomously using Claude Code headless mode or in-session hook-based loops. Supports structured task decomposition (for complex projects) and lightweight Ralph-style iteration (for TDD, bug fixing, refactoring). Use this skill whenever the user says "autonomous", "long-running task", "multi-session", "run this in the background", "keep working on this", "batch process", "iterate until done", "ralph loop", or wants any task that requires sustained, unattended execution.

1,393 207

Explore

feiskyer/claude-code-settings

deep-research

深度调研的多Agent编排工作流：把一个调研目标拆成可并行子目标，用 Claude Code 非交互模式（`claude -p`）运行子进程；联网与采集优先使用已安装的 skills，其次使用 MCP 工具；用脚本聚合子结果并分章精修，最终交付"成品报告文件路径 + 关键结论/建议摘要"。用于：系统性网页/资料调研、竞品/行业分析、批量链接/数据集分片检索、长文写作与证据整合，或用户提及"深度调研/Deep Research/Wide Research/多 Agent 并行调研/多进程调研"等场景。

1,393 207

Explore

feiskyer/claude-code-settings

translate

Translate English or Japanese tech articles and texts into natural, fluent Chinese. Use this skill when the user wants to translate text to Chinese, asks for Chinese translation, mentions "translate to Chinese", "翻译", provides English/Japanese tech content for translation, or wants any text converted into Chinese. Also trigger when the user pastes text and asks to translate it, or references a file to translate into Chinese.

1,393 207

Explore

feiskyer/claude-code-settings

spec-kit-skill

GitHub Spec-Kit integration for constitution-based spec-driven development. 7-phase workflow. Triggers: "spec-kit", "speckit", "constitution", "specify", ".specify/", "规格驱动开发", "需求规格".

1,393 207

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Nanobanana Image Generation Skill

Requirements

Instructions

For image generation

For image editing

Available Options

Aspect Ratios (--size)

Models (--model)

Resolution (--resolution)

Other Options

Examples

Generate a simple image

Generate with specific size and output

Generate landscape image with high resolution

Edit existing images

Use pro model for higher quality

Error Handling

Best Practices

Recommended Agent Skills

kiro-skill

github-review-pr

autonomous-skill

deep-research

translate

spec-kit-skill