Agent skill

gemini-imagen

Generate images using Google Gemini's image generation APIs via Python CLI. Use for "generate image", "create image", "gemini image", "AI image", or when needing AI image generation in Claude Code.

View SKILL.md on GitHub Repository

Stars 0

Forks 0

Install this agent skill to your Project

npx add-skill https://github.com/notque/gemini-imagen/tree/main/skills/gemini-imagen

SKILL.md

Gemini Imagen

Generate images from text prompts using Google's Gemini APIs. This plugin gives Claude Code the ability to generate images directly.

Quick Start

bash

# Generate an image
python3 ~/.claude/plugins/gemini-imagen/skills/gemini-imagen/scripts/generate_image.py \
  --prompt "A cute cartoon cat" \
  --output cat.png

CRITICAL: Exact Model Names

Use ONLY these exact model strings:

Model String	Speed	Best For
`gemini-2.5-flash-image`	Fast (2-5s)	Drafts, iterations
`gemini-3-pro-image-preview`	Slower (5-15s)	Quality, text rendering, 2K

Common mistakes:

gemini-2.5-flash-preview-05-20 - WRONG (date suffixes are for text models)
gemini-2.5-pro-image - WRONG (doesn't exist)
gemini-3-flash-image - WRONG (doesn't exist)

Instructions

Step 1: Check API Key

bash

echo "GEMINI_API_KEY is ${GEMINI_API_KEY:+set}"

If not set, tell the user to run /imagen:setup.

Step 2: Install Dependencies

bash

pip install google-genai Pillow

Step 3: Generate Image

bash

python3 ~/.claude/plugins/gemini-imagen/skills/gemini-imagen/scripts/generate_image.py \
  --prompt "YOUR PROMPT HERE" \
  --output /path/to/output.png

Step 4: Verify Output

bash

ls -la /path/to/output.png

Model Selection

Use Case	Model	Why
Iterating on prompts	`gemini-2.5-flash-image`	Fast feedback (2-5s)
Final asset	`gemini-3-pro-image-preview`	Best quality
Game sprites	`gemini-2.5-flash-image`	Many images, consistent
Text in image	`gemini-3-pro-image-preview`	Better typography
Batch generation	`gemini-2.5-flash-image`	Cost effective

Post-Processing Options

Remove Watermarks (`--remove-watermark`)

Removes bright pixels from image corners. Very useful for cleaning up generated images.

Background Transparency (`--transparent-bg`)

Converts solid-color backgrounds to transparent. Great for sprites and icons.

bash

python3 generate_image.py \
  --prompt "Character on gray background" \
  --output char.png \
  --remove-watermark \
  --transparent-bg

Batch Generation

Generate multiple images from a file:

bash

# prompts.txt (one per line)
python3 generate_image.py \
  --batch prompts.txt \
  --output-dir ./images/

Error Handling

Error	Solution
`GEMINI_API_KEY not set`	Run `/imagen:setup`
`Rate limit (429)`	Wait 60s, script auto-retries
`Content policy (400)`	Modify prompt
`No image in response`	Add more detail to prompt
`Pillow not installed`	Run `pip install Pillow`

Script Reference

Location: scripts/generate_image.py

Argument	Required	Description
`--prompt`	Yes*	Text prompt
`--output`	Yes*	Output file path (.png)
`--model`	No	Model (default: gemini-3-pro-image-preview)
`--remove-watermark`	No	Remove corner watermarks
`--transparent-bg`	No	Make background transparent
`--bg-color`	No	Background hex color (default: #3a3a3a)
`--batch`	No	Prompts file (one per line)
`--output-dir`	No	Directory for batch output

*Required unless using --batch

Exit Codes:

0: Success
1: Missing API key
2: Generation failed
3: Invalid arguments

What This Plugin CAN Do

Generate images from text prompts
Select between fast and quality models
Remove watermarks from images
Make backgrounds transparent
Batch generate multiple images

What This Plugin CANNOT Do

Use non-Gemini models (DALL-E, Midjourney, Stable Diffusion)
Generate video or audio
Bypass content policy restrictions

Maintainer

notque Core maintainer

Source details

Full Name: notque/gemini-imagen
Branch: main
Path in repo: skills/gemini-imagen
License: MIT License

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

notque/claude-code-toolkit

voice-writer

Unified voice content generation pipeline with mandatory validation and joy-check. 9-phase pipeline: LOAD, GROUND, GENERATE, VALIDATE, REFINE, JOY-CHECK, OUTPUT, CLEANUP. Use when writing articles, blog posts, or any content that uses a voice profile. Use for "write article", "blog post", "write in voice", "generate content", "draft article", "write about".

324 31

Explore

notque/claude-code-toolkit

image-auditor

Non-destructive image validation for accessibility and health.

324 31

Explore

notque/claude-code-toolkit

video-editing

Video editing pipeline: cut footage, assemble clips via FFmpeg and Remotion.

324 31

Explore

notque/claude-code-toolkit

comment-quality

Review and fix temporal references in code comments.

324 31

Explore

notque/claude-code-toolkit

e2e-testing

Playwright-based end-to-end testing workflow.

324 31

Explore

notque/claude-code-toolkit

anti-ai-editor

Remove AI-sounding patterns from content.

324 31

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Gemini Imagen

Quick Start

CRITICAL: Exact Model Names

Instructions

Step 1: Check API Key

Step 2: Install Dependencies

Step 3: Generate Image

Step 4: Verify Output

Model Selection

Post-Processing Options

Remove Watermarks (--remove-watermark)

Background Transparency (--transparent-bg)

Batch Generation

Error Handling

Script Reference

What This Plugin CAN Do

What This Plugin CANNOT Do

Recommended Agent Skills

voice-writer

image-auditor

video-editing

comment-quality

e2e-testing

anti-ai-editor

Remove Watermarks (`--remove-watermark`)

Background Transparency (`--transparent-bg`)