Agent skill

nano-banana-2

Generate and edit images using Google's Nano Banana 2 (Gemini 3.1 Flash Image Preview) API. This skill should be used when the user asks to create or modify images, especially when they need fast iteration, explicit aspect-ratio control, or resolution control from 512px to 4K.

View SKILL.md on GitHub Repository

Stars 232

Forks 19

Install this agent skill to your Project

npx add-skill https://github.com/intellectronica/agent-skills/tree/main/skills/nano-banana-2

SKILL.md

Nano Banana 2 Image Generation & Editing

Generate new images or edit existing images with Nano Banana 2 (gemini-3.1-flash-image-preview).

Usage

Run the script using absolute or workspace-relative path (do NOT cd into the skill directory first):

Generate new image:

bash

uv run ./.agents/skills/nano-banana-2/scripts/generate_image.py --prompt "your image description" --filename "output-name.png" [--resolution 512px|1K|2K|4K] [--aspect-ratio RATIO] [--api-key KEY]

Edit image(s) with references:

bash

uv run ./.agents/skills/nano-banana-2/scripts/generate_image.py --prompt "editing instructions" --filename "output-name.png" --input-image "path/to/input1.png" [--input-image "path/to/input2.png" ...] [--resolution 512px|1K|2K|4K] [--aspect-ratio RATIO] [--api-key KEY]

Always run from the user's current working directory so outputs are saved in the active project context.

Model

Use model code:

gemini-3.1-flash-image-preview (Nano Banana 2)

Do not use gemini-3-pro-image-preview in this skill.

Resolution Options

Nano Banana 2 supports:

512px (fastest, low-latency iteration)
1K (default)
2K
4K

Map common user language:

"quick draft", "thumbnail", "tiny", "512", "0.5K" → 512px
no resolution mention → 1K
"2K", "medium", "normal" → 2K
"4K", "high-res", "ultra", "print quality" → 4K

Use uppercase K for 1K, 2K, 4K.

Aspect Ratios

Supported aspect ratios:

1:1, 1:4, 1:8, 2:3, 3:2, 3:4, 4:1, 4:3, 4:5, 5:4, 8:1, 9:16, 16:9, 21:9

Map common requests:

square post / icon → 1:1
phone wallpaper / vertical reel → 9:16
widescreen / slide / hero image → 16:9
cinematic wide → 21:9
portrait print → 2:3 or 3:4
panorama banner → 4:1 or 8:1

If unspecified, let the model default behavior apply.

Reference Images (New in Gemini 3 Image workflows)

Provide up to 14 reference images when needed by repeating --input-image:

preserve character consistency
preserve object fidelity
combine multiple visual references into one output

Use one image for simple edits; use multiple images for composition or consistency-sensitive tasks.

API Key

Resolve API key in this order:

--api-key argument
GEMINI_API_KEY environment variable

If no key is available, stop and report a clear error.

Filename Generation

Generate filenames as: yyyy-mm-dd-hh-mm-ss-name.png

Examples:

2026-02-26-17-31-04-japanese-garden.png
2026-02-26-17-31-59-social-banner.png

Prompt Handling

For generation, pass the user's request as-is unless critically underspecified. For editing, include explicit transformation instructions in the prompt and preserve the original intent.

Output

Save PNG to the current directory (or to a provided path in --filename)
Return the full saved path(s)
Do not read the output image back unless explicitly requested

Examples

Create a 4K widescreen image:

bash

uv run ./.agents/skills/nano-banana-2/scripts/generate_image.py --prompt "Futuristic city skyline at blue hour with neon reflections" --filename "2026-02-26-17-45-00-futuristic-skyline.png" --resolution 4K --aspect-ratio 16:9

Edit with multiple references:

bash

uv run ./.agents/skills/nano-banana-2/scripts/generate_image.py --prompt "Create a polished campaign image that keeps the exact logo details and character identity" --filename "2026-02-26-17-50-10-campaign-image.png" --input-image "logo.png" --input-image "character.png" --resolution 2K --aspect-ratio 4:5

Maintainer

intellectronica Core maintainer

Source details

Full Name: intellectronica/agent-skills
Branch: main
Path in repo: skills/nano-banana-2
License: Creative Commons Zero v1.0 Universal
Topics: agent skills

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

intellectronica/agent-skills

raindrop-api

This skill provides comprehensive instructions for interacting with the Raindrop.io bookmarks service via its REST API using curl and jq. It covers authentication, CRUD operations for collections, raindrops (bookmarks), tags, highlights, filters, import/export, and backups. Use this skill whenever the user asks to work with their bookmarks from Raindrop.io, including reading, creating, updating, deleting, searching, or organising bookmarks and collections.

232 19

Explore

intellectronica/agent-skills

ultrathink

Display colorful ANSI art of the word "ultrathink". Use when the user says "ultrathink" or invokes /ultrathink.

232 19

Explore

intellectronica/agent-skills

nano-banana-pro

Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., "modify this image", "change the background", "replace X with Y"). Supports both text-to-image generation and image-to-image editing with configurable resolution (1K default, 2K, or 4K for high resolution). DO NOT read the image file first - use this skill directly with the --input-image parameter.

232 19

Explore

intellectronica/agent-skills

upstash-redis-kv

Read and write to Upstash Redis-compatible key-value store via REST API. Use when there is a need to save or retrieve key-value data, use Redis features (caching, counters, lists, sets, hashes, sorted sets, etc.) for the current interaction, or when the user explicitly asks to use Upstash or Redis.

232 19

Explore

intellectronica/agent-skills

notion-api

This skill provides comprehensive instructions for interacting with the Notion API via REST calls. This skill should be used whenever the user asks to interact with Notion, including reading, creating, updating, or deleting pages, databases, blocks, comments, or any other Notion content. The skill covers authentication, all available endpoints, pagination, error handling, and best practices.

232 19

Explore

intellectronica/agent-skills

gpt-image-1-5

Generate and edit images using OpenAI's GPT Image 1.5 model. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., "modify this image", "change the background", "replace X with Y"). Supports text-to-image generation and image editing with optional mask. DO NOT read the image file first - use this skill directly with the --input-image parameter.

232 19

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Nano Banana 2 Image Generation & Editing

Usage

Model

Resolution Options

Aspect Ratios

Reference Images (New in Gemini 3 Image workflows)

API Key

Filename Generation

Prompt Handling

Output

Examples

Recommended Agent Skills

raindrop-api

ultrathink

nano-banana-pro

upstash-redis-kv

notion-api

gpt-image-1-5