Agent skill

nano-banana

Generate images with Google Gemini native image models via inference.sh CLI. Models: Gemini 3 Pro Image, Gemini 2.5 Flash Image. Capabilities: text-to-image, image editing, multi-image input. Triggers: nano banana, gemini image, gemini 3 pro image, gemini 2.5 flash image, google image generation, native image generation, gemini native image

Stars 0
Forks 0

Install this agent skill to your Project

npx add-skill https://github.com/autohandai/community-skills/tree/main/nano-banana

SKILL.md

Nano Banana - Gemini Native Image Generation

Generate images with Google Gemini native image models via inference.sh CLI.

Nano Banana

Quick Start

Requires inference.sh CLI (infsh). Get installation instructions: npx skills add inference-sh/skills@agent-tools

bash
infsh login

infsh app run google/gemini-3-pro-image-preview --input '{"prompt": "a banana in space, photorealistic"}'

Models

Model App ID Speed Quality
Gemini 3 Pro Image google/gemini-3-pro-image-preview Slower Best
Gemini 2.5 Flash Image google/gemini-2-5-flash-image Fast Excellent

Search Gemini Image Apps

bash
infsh app list --search "gemini image"

Examples

Basic Text-to-Image

bash
infsh app run google/gemini-3-pro-image-preview --input '{
  "prompt": "A futuristic cityscape at sunset with flying cars"
}'

Multiple Images

bash
infsh app run google/gemini-2-5-flash-image --input '{
  "prompt": "Minimalist logo design for a coffee shop",
  "num_images": 4
}'

Custom Aspect Ratio

bash
infsh app run google/gemini-3-pro-image-preview --input '{
  "prompt": "Panoramic mountain landscape with northern lights",
  "aspect_ratio": "16:9"
}'

Image Editing (with input image)

bash
infsh app run google/gemini-2-5-flash-image --input '{
  "prompt": "Add a rainbow in the sky",
  "images": ["https://example.com/landscape.jpg"]
}'

High Resolution (4K)

bash
infsh app run google/gemini-3-pro-image-preview --input '{
  "prompt": "Detailed illustration of a medieval castle",
  "resolution": "4K"
}'

With Google Search Grounding

bash
infsh app run google/gemini-3-pro-image-preview --input '{
  "prompt": "Current weather in Tokyo visualized as an artistic scene",
  "enable_google_search": true
}'

Input Options

Parameter Type Description
prompt string Required. What to generate or change
images array Input images for editing (up to 14)
num_images integer Number of images to generate
aspect_ratio string Output ratio: "1:1", "16:9", "9:16", "4:3", "3:4", "auto"
resolution string "1K", "2K", "4K" (Gemini 3 Pro only)
output_format string Output format for images
enable_google_search boolean Enable real-time info grounding

Prompt Tips

Styles: photorealistic, illustration, watercolor, oil painting, digital art, anime, 3D render

Composition: close-up, wide shot, aerial view, macro, portrait, landscape

Lighting: natural light, studio lighting, golden hour, dramatic shadows, neon

Details: add specific details about textures, colors, mood, atmosphere

Sample Workflow

bash
# 1. Generate sample input to see all options
infsh app sample google/gemini-3-pro-image-preview --save input.json

# 2. Edit the prompt
# 3. Run
infsh app run google/gemini-3-pro-image-preview --input input.json

Related Skills

bash
# Full platform skill (all 150+ apps)
npx skills add inference-sh/skills@agent-tools

# All image generation models
npx skills add inference-sh/skills@ai-image-generation

# Video generation (for image-to-video)
npx skills add inference-sh/skills@ai-video-generation

Browse all image apps: infsh app list --category image

Documentation

Expand your agent's capabilities with these related and highly-rated skills.

autohandai/community-skills

mapping-mitre-attack-techniques

Maps observed adversary behaviors, security alerts, and detection rules to MITRE ATT&CK techniques and sub-techniques to quantify detection coverage and guide control prioritization. Use when building an ATT&CK-based coverage heatmap, tagging SIEM alerts with technique IDs, aligning security controls to adversary playbooks, or reporting threat exposure to executives. Activates for requests involving ATT&CK Navigator, Sigma rules, MITRE D3FEND, or coverage gap analysis.

0 0
Explore
autohandai/community-skills

hunting-for-spearphishing-indicators

Hunt for spearphishing campaign indicators across email logs, endpoint telemetry, and network data to detect targeted email attacks.

0 0
Explore
autohandai/community-skills

analyzing-malicious-url-with-urlscan

URLScan.io is a free service for scanning and analyzing suspicious URLs. It captures screenshots, DOM content, HTTP transactions, JavaScript behavior, and network connections of web pages in an isolat

0 0
Explore
autohandai/community-skills

implementing-zero-standing-privilege-with-cyberark

Deploy CyberArk Secure Cloud Access to eliminate standing privileges in hybrid and multi-cloud environments using just-in-time access with time, entitlement, and approval controls.

0 0
Explore
autohandai/community-skills

implementing-pam-for-database-access

Deploy privileged access management for database systems including Oracle, SQL Server, PostgreSQL, and MySQL. Covers session proxy configuration, credential vaulting, query auditing, dynamic credentia

0 0
Explore
autohandai/community-skills

detecting-t1003-credential-dumping-with-edr

Detect OS credential dumping techniques targeting LSASS memory, SAM database, NTDS.dit, and cached credentials using EDR telemetry, Sysmon process access monitoring, and Windows security event correlation.

0 0
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results