Agent skill
nano-banana
Generate images with Google Gemini native image models via inference.sh CLI. Models: Gemini 3 Pro Image, Gemini 2.5 Flash Image. Capabilities: text-to-image, image editing, multi-image input. Triggers: nano banana, gemini image, gemini 3 pro image, gemini 2.5 flash image, google image generation, native image generation, gemini native image
Install this agent skill to your Project
npx add-skill https://github.com/autohandai/community-skills/tree/main/nano-banana
SKILL.md
Nano Banana - Gemini Native Image Generation
Generate images with Google Gemini native image models via inference.sh CLI.
Quick Start
Requires inference.sh CLI (
infsh). Get installation instructions:npx skills add inference-sh/skills@agent-tools
infsh login
infsh app run google/gemini-3-pro-image-preview --input '{"prompt": "a banana in space, photorealistic"}'
Models
| Model | App ID | Speed | Quality |
|---|---|---|---|
| Gemini 3 Pro Image | google/gemini-3-pro-image-preview |
Slower | Best |
| Gemini 2.5 Flash Image | google/gemini-2-5-flash-image |
Fast | Excellent |
Search Gemini Image Apps
infsh app list --search "gemini image"
Examples
Basic Text-to-Image
infsh app run google/gemini-3-pro-image-preview --input '{
"prompt": "A futuristic cityscape at sunset with flying cars"
}'
Multiple Images
infsh app run google/gemini-2-5-flash-image --input '{
"prompt": "Minimalist logo design for a coffee shop",
"num_images": 4
}'
Custom Aspect Ratio
infsh app run google/gemini-3-pro-image-preview --input '{
"prompt": "Panoramic mountain landscape with northern lights",
"aspect_ratio": "16:9"
}'
Image Editing (with input image)
infsh app run google/gemini-2-5-flash-image --input '{
"prompt": "Add a rainbow in the sky",
"images": ["https://example.com/landscape.jpg"]
}'
High Resolution (4K)
infsh app run google/gemini-3-pro-image-preview --input '{
"prompt": "Detailed illustration of a medieval castle",
"resolution": "4K"
}'
With Google Search Grounding
infsh app run google/gemini-3-pro-image-preview --input '{
"prompt": "Current weather in Tokyo visualized as an artistic scene",
"enable_google_search": true
}'
Input Options
| Parameter | Type | Description |
|---|---|---|
prompt |
string | Required. What to generate or change |
images |
array | Input images for editing (up to 14) |
num_images |
integer | Number of images to generate |
aspect_ratio |
string | Output ratio: "1:1", "16:9", "9:16", "4:3", "3:4", "auto" |
resolution |
string | "1K", "2K", "4K" (Gemini 3 Pro only) |
output_format |
string | Output format for images |
enable_google_search |
boolean | Enable real-time info grounding |
Prompt Tips
Styles: photorealistic, illustration, watercolor, oil painting, digital art, anime, 3D render
Composition: close-up, wide shot, aerial view, macro, portrait, landscape
Lighting: natural light, studio lighting, golden hour, dramatic shadows, neon
Details: add specific details about textures, colors, mood, atmosphere
Sample Workflow
# 1. Generate sample input to see all options
infsh app sample google/gemini-3-pro-image-preview --save input.json
# 2. Edit the prompt
# 3. Run
infsh app run google/gemini-3-pro-image-preview --input input.json
Related Skills
# Full platform skill (all 150+ apps)
npx skills add inference-sh/skills@agent-tools
# All image generation models
npx skills add inference-sh/skills@ai-image-generation
# Video generation (for image-to-video)
npx skills add inference-sh/skills@ai-video-generation
Browse all image apps: infsh app list --category image
Documentation
- Running Apps - How to run apps via CLI
- Streaming Results - Real-time progress updates
- File Handling - Working with images
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
mapping-mitre-attack-techniques
Maps observed adversary behaviors, security alerts, and detection rules to MITRE ATT&CK techniques and sub-techniques to quantify detection coverage and guide control prioritization. Use when building an ATT&CK-based coverage heatmap, tagging SIEM alerts with technique IDs, aligning security controls to adversary playbooks, or reporting threat exposure to executives. Activates for requests involving ATT&CK Navigator, Sigma rules, MITRE D3FEND, or coverage gap analysis.
hunting-for-spearphishing-indicators
Hunt for spearphishing campaign indicators across email logs, endpoint telemetry, and network data to detect targeted email attacks.
analyzing-malicious-url-with-urlscan
URLScan.io is a free service for scanning and analyzing suspicious URLs. It captures screenshots, DOM content, HTTP transactions, JavaScript behavior, and network connections of web pages in an isolat
implementing-zero-standing-privilege-with-cyberark
Deploy CyberArk Secure Cloud Access to eliminate standing privileges in hybrid and multi-cloud environments using just-in-time access with time, entitlement, and approval controls.
implementing-pam-for-database-access
Deploy privileged access management for database systems including Oracle, SQL Server, PostgreSQL, and MySQL. Covers session proxy configuration, credential vaulting, query auditing, dynamic credentia
detecting-t1003-credential-dumping-with-edr
Detect OS credential dumping techniques targeting LSASS memory, SAM database, NTDS.dit, and cached credentials using EDR telemetry, Sysmon process access monitoring, and Windows security event correlation.
Didn't find tool you were looking for?