Agent skill

generate-image

Generate or edit images using AI models (FLUX, Gemini). Use for scientific illustrations, diagrams, schematics, infographics, concept visualizations, and artistic images. Supports image editing to modify existing images (change colors, add/remove elements, style transfer). Useful for figures, posters, and visual explanations.

View SKILL.md on GitHub Repository

Stars 152

Forks 20

Install this agent skill to your Project

npx add-skill https://github.com/Microck/ordinary-claude-skills/tree/main/skills_all/claude-scientific-skills/scientific-skills/generate-image

SKILL.md

Generate Image

Generate and edit high-quality images using OpenRouter's image generation models including FLUX.2 Pro and Nano Banana Pro (Gemini 3 Pro).

Quick Start

Use the scripts/generate_image.py script to generate or edit images:

bash

# Generate a new image
python scripts/generate_image.py "A beautiful sunset over mountains"

# Edit an existing image
python scripts/generate_image.py "Make the sky purple" --input photo.jpg

This generates/edits an image and saves it as generated_image.png in the current directory.

API Key Setup

CRITICAL: The script requires an OpenRouter API key. Before running, check if the user has configured their API key:

Look for a .env file in the project directory or parent directories
Check for OPENROUTER_API_KEY=<key> in the .env file
If not found, inform the user they need to:
- Create a .env file with OPENROUTER_API_KEY=your-api-key-here
- Or set the environment variable: export OPENROUTER_API_KEY=your-api-key-here
- Get an API key from: https://openrouter.ai/keys

The script will automatically detect the .env file and provide clear error messages if the API key is missing.

Model Selection

Default model: google/gemini-3-pro-image-preview (high quality, recommended)

Available models for generation and editing:

google/gemini-3-pro-image-preview - High quality, supports generation + editing
black-forest-labs/flux.2-pro - Fast, high quality, supports generation + editing

Generation only:

black-forest-labs/flux.2-dev - Development version, generation only

Select based on:

Quality: Use gemini-3-pro or flux.2-pro
Editing: Use gemini-3-pro or flux.2-pro (both support image editing)
Cost: Use flux.2-dev for generation only

Common Usage Patterns

Basic generation

bash

python scripts/generate_image.py "Your prompt here"

Specify model

bash

python scripts/generate_image.py "A cat in space" --model "black-forest-labs/flux.2-pro"

Custom output path

bash

python scripts/generate_image.py "Abstract art" --output artwork.png

Edit an existing image

bash

python scripts/generate_image.py "Make the background blue" --input photo.jpg

Edit with a specific model

bash

python scripts/generate_image.py "Add sunglasses to the person" --input portrait.png --model "black-forest-labs/flux.2-pro"

Edit with custom output

bash

python scripts/generate_image.py "Remove the text from the image" --input screenshot.png --output cleaned.png

Multiple images

Run the script multiple times with different prompts or output paths:

bash

python scripts/generate_image.py "Image 1 description" --output image1.png
python scripts/generate_image.py "Image 2 description" --output image2.png

Script Parameters

prompt (required): Text description of the image to generate, or editing instructions
--input or -i: Input image path for editing (enables edit mode)
--model or -m: OpenRouter model ID (default: google/gemini-3-pro-image-preview)
--output or -o: Output file path (default: generated_image.png)
--api-key: OpenRouter API key (overrides .env file)

Error Handling

The script provides clear error messages for:

Missing API key (with setup instructions)
API errors (with status codes)
Unexpected response formats
Missing dependencies (requests library)

If the script fails, read the error message and address the issue before retrying.

Notes

Images are returned as base64-encoded data URLs and automatically saved as PNG files
The script supports both images and content response formats from different OpenRouter models
Generation time varies by model (typically 5-30 seconds)
For image editing, the input image is encoded as base64 and sent to the model
Supported input image formats: PNG, JPEG, GIF, WebP
Check OpenRouter pricing for cost information: https://openrouter.ai/models

Image Editing Tips

Be specific about what changes you want (e.g., "change the sky to sunset colors" vs "edit the sky")
Reference specific elements in the image when possible
For best results, use clear and detailed editing instructions
Both Gemini 3 Pro and FLUX.2 Pro support image editing through OpenRouter

Maintainer

Microck Core maintainer

Source details

Full Name: Microck/ordinary-claude-skills
Branch: main
Path in repo: skills_all/claude-scientific-skills/scientific-skills/generate-image
License: Other
Topics: claude-code claude claude-skills collection list

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

Microck/ordinary-claude-skills

nondominium-holochain-dna-dev

Specialized skill for nondominium Holochain DNA development, focusing on zome creation, entry patterns, integrity/coordinator architecture, ValueFlows compliance, and WASM optimization. Use when creating new zomes, implementing entry types, or modifying Holochain DNA code.

152 20

Explore

Microck/ordinary-claude-skills

fluidsim

Framework for computational fluid dynamics simulations using Python. Use when running fluid dynamics simulations including Navier-Stokes equations (2D/3D), shallow water equations, stratified flows, or when analyzing turbulence, vortex dynamics, or geophysical flows. Provides pseudospectral methods with FFT, HPC support, and comprehensive output analysis.

152 20

Explore

Microck/ordinary-claude-skills

metabolomics-workbench-database

Access NIH Metabolomics Workbench via REST API (4,200+ studies). Query metabolites, RefMet nomenclature, MS/NMR data, m/z searches, study metadata, for metabolomics and biomarker discovery.

152 20

Explore

Microck/ordinary-claude-skills

run-tests

Validate code changes by intelligently selecting and running the appropriate test suites. Use this when editing code to verify changes work correctly, run tests, validate functionality, or check for regressions. Automatically discovers affected test suites, selects the minimal set of venvs needed for validation, and handles test execution with Docker services as needed.

152 20

Explore

Microck/ordinary-claude-skills

skill-navigator

The 100th skill! Your intelligent guide to all 99 other skills. Recommends the perfect skill for any task, creates skill combinations, and helps you discover capabilities you didn't know you had.

152 20

Explore

Microck/ordinary-claude-skills

AgentDB Advanced Features

Master advanced AgentDB features including QUIC synchronization, multi-database management, custom distance metrics, hybrid search, and distributed systems integration. Use when building distributed AI systems, multi-agent coordination, or advanced vector search applications.

152 20

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Generate Image

Quick Start

API Key Setup

Model Selection

Common Usage Patterns

Basic generation

Specify model

Custom output path

Edit an existing image

Edit with a specific model

Edit with custom output

Multiple images

Script Parameters

Error Handling

Notes

Image Editing Tips

Recommended Agent Skills

nondominium-holochain-dna-dev

fluidsim

metabolomics-workbench-database

run-tests

skill-navigator

AgentDB Advanced Features