Agent skill

gemini-image-generator

Generate, edit, or transform images with Gemini Nano Banana using bundled Python scripts (Flash or Pro) including aspect ratio, resolution, image-to-image edits, logo overlays, and reference images. Use when users request image generation, image edits, image-to-image transformations, logo placement, or specific aspect ratios or resolutions.

Stars 1
Forks 0

Install this agent skill to your Project

npx add-skill https://github.com/feed-mob/agent-skills/tree/main/skills/gemini-image-generator

SKILL.md

Gemini Image Generator

Use this skill to turn a user prompt (and optional images) into Gemini image generation calls via the bundled Python scripts.

Workflow

  1. Collect the user prompt and any images (local paths or URLs).
  2. Infer the operation mode and translate parameters into CLI flags.
  3. Run the appropriate script to generate or edit images.
  4. Return the output file paths.

Defaults and Rules

  • Default model: gemini-2.5-flash-image (CLI value flash).
  • Default aspect ratio: 9:16.
  • Default count: 1 (max 3).
  • Default image size: 1K, but only apply it for the Pro model.
  • If the user specifies a size (1K|2K|4K), switch to Pro (gemini-3-pro-image-preview).
  • If the user explicitly asks for Pro or higher quality, use Pro.
  • If the user supplies multiple reference images (2+), switch to Pro.
  • Logo overlay always uses Pro (even if the user asks for Flash).
  • Only set --size when using Pro.

Allowed Values

  • Aspect ratios: 1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9.
  • Image sizes (Pro only): 1K, 2K, 4K.
  • Reference images (Pro): up to 14 total. Gemini guidance: up to 6 object images + up to 5 human images.

Script

Scripts

Text-to-image

Run:

bash
python scripts/generate_image.py \
  --prompt "<user prompt>" \
  --aspect 9:16 \
  --count 1 \
  --model flash \
  --out-dir outputs

Only add flags when the user asks for them. The script reads GEMINI_API_KEY from the environment.

Image editing / image-to-image

Use when the user supplies a base image to edit or transform.

Run:

bash
python scripts/edit_image.py \
  --input /path/to/base.png \
  --prompt "<edit instructions>" \
  --reference /path/to/ref1.png \
  --reference https://example.com/ref2.png \
  --aspect 9:16 \
  --count 1 \
  --model flash \
  --out-dir outputs

Logo overlay

Use when the user wants to place a logo onto a base image.

Run:

bash
python scripts/logo_overlay.py \
  --base /path/to/base.png \
  --logo /path/to/logo.png \
  --aspect 9:16 \
  --count 1 \
  --model pro \
  --out-dir outputs

Examples

User: "Generate a portrait of a dancer in a foggy forest." Claude:

  • Use defaults (flash, 9:16, count 1).
  • Run: python scripts/generate_image.py --prompt "Generate a portrait of a dancer in a foggy forest."

User: "Make a 2K 16:9 cinematic still of a neon city, give me 3 options." Claude:

  • Use Pro with size 2K, aspect 16:9, count 3.
  • Run: python scripts/generate_image.py --prompt "Make a 2K 16:9 cinematic still of a neon city" --aspect 16:9 --size 2K --count 3 --model pro

User: "Edit this image to remove the background and make it studio white." (with one image) Claude:

  • Use edit script with Flash.
  • Run: python scripts/edit_image.py --input /path/to/image.png --prompt "Remove the background and make it studio white."

User: "Put this logo on the shirt in the photo." (with base + logo images) Claude:

  • Use logo overlay script (Pro).
  • Run: python scripts/logo_overlay.py --base /path/to/photo.png --logo /path/to/logo.png

Notes

  • If the script fails with a missing module, install google-genai and retry.
  • Dependencies live in scripts/requirements.txt (install with pip install -r scripts/requirements.txt).
  • Output files are written into the outputs/ directory using timestamped names.
  • For prompt best practices and templates, read references/prompt-guide.md.
  • For logo-specific guidance, read references/logo-overlay.md.
  • For edit/image-to-image guidance, read references/image-editing.md.
  • For watermarking guidance, read references/watermarking.md.

Expand your agent's capabilities with these related and highly-rated skills.

feed-mob/agent-skills

skill-name

A clear, complete description of what this skill does and when Claude should use it

1 0
Explore
feed-mob/agent-skills

feedmob-reporting-skills

MUST use this skill when using mcp__feedmob-reporting__* tools. Provides structured workflows for FeedMob reporting analysis. Key: All clients (Possible Finance, Koho Financial, TextNow, AppsFlyer MMP, etc.) use client_paid_action_count × gross_cpi (dynamic event field based on client_paid_action in click_url_histories). Critical for ensuring correct multi-step workflows and accurate data reconciliation. Trigger words: any feedmob-reporting MCP tool usage, Possible Finance, Koho Financial, TextNow, AppsFlyer, Singular reports, Adjust reports, direct spend, gross spend verification, spend reconciliation, client_paid_action.

1 0
Explore
feed-mob/agent-skills

feedmob-campaign-creator

Create FeedMob campaigns with guided workflow. Use when users request campaign creation, link generation, or new partner launches. Handles: (1) Campaign naming via feedmob-campaign-naming, (2) Client data retrieval, (3) App selection, (4) Campaign preview, (5) Campaign creation. Trigger keywords: create campaign, new campaign, generate campaign, campaign setup, link generation, partner launch, media plan campaign, 创建campaign, 生成活动, 新建campaign.

1 0
Explore
feed-mob/agent-skills

install-civitai-videoflow-bundle

Automatically installs and validates the Civitai Videoflow skill bundle from the civitai-agent-skills repository. Supports git clone or manual zip upload, runs npx skills add in dependency-safe order, and guides environment and tool prerequisite checks for CIVITAI_RECORDS_DATABASE_URL, DUOMI_API_TOKEN, IMAGEKIT_PRIVATE_KEY, and CIVITAI_ACCOUNT. Use when: user needs videoflow setup, install Civitai pipeline skills, configure image-to-video workflow, enable Civitai publish pipeline, or when civitai-videoflow and related worker skills are mentioned but unavailable. Triggers: install videoflow skills, setup civitai skills bundle, configure civitai-agent-skills, enable videoflow commands, install duomi/civitai pipeline skills.

1 0
Explore
feed-mob/agent-skills

install-weekly-hubspot-report-bundle

Automatically installs and configures weekly-hubspot-report and weekly-hubspot-report-pipeline skills from feedmob-skills repository. Supports git clone or manual zip upload, runs npx skills add commands, and guides environment variable setup for FEMINI_API_TOKEN, FEEDAI_API_TOKEN, and AWS credentials. Use when: user needs HubSpot reporting, install HubSpot skills, generate weekly ticket reports, configure report pipeline, or when weekly-hubspot-report* skills are mentioned but not available. Triggers: install HubSpot skills, setup HubSpot reporting, HubSpot weekly report, configure HubSpot, feedmob-skills installation.

1 0
Explore
feed-mob/agent-skills

civitai-analyst

1 0
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results