Agent skill
nano-banana
Generate images with Google Gemini native image models via inference.sh CLI. Models: Gemini 3 Pro Image, Gemini 2.5 Flash Image. Capabilities: text-to-image, image editing, multi-image input. Triggers: nano banana, gemini image, gemini 3 pro image, gemini 2.5 flash image, google image generation, native image generation, gemini native image
Install this agent skill to your Project
npx add-skill https://github.com/aiskillstore/marketplace/tree/main/skills/inferen-sh/nano-banana
SKILL.md
Nano Banana - Gemini Native Image Generation
Generate images with Google Gemini native image models via inference.sh CLI.
Quick Start
Requires inference.sh CLI (
infsh). Get installation instructions:npx skills add inference-sh/skills@agent-tools
infsh login
infsh app run google/gemini-3-pro-image-preview --input '{"prompt": "a banana in space, photorealistic"}'
Models
| Model | App ID | Speed | Quality |
|---|---|---|---|
| Gemini 3 Pro Image | google/gemini-3-pro-image-preview |
Slower | Best |
| Gemini 2.5 Flash Image | google/gemini-2-5-flash-image |
Fast | Excellent |
Search Gemini Image Apps
infsh app list --search "gemini image"
Examples
Basic Text-to-Image
infsh app run google/gemini-3-pro-image-preview --input '{
"prompt": "A futuristic cityscape at sunset with flying cars"
}'
Multiple Images
infsh app run google/gemini-2-5-flash-image --input '{
"prompt": "Minimalist logo design for a coffee shop",
"num_images": 4
}'
Custom Aspect Ratio
infsh app run google/gemini-3-pro-image-preview --input '{
"prompt": "Panoramic mountain landscape with northern lights",
"aspect_ratio": "16:9"
}'
Image Editing (with input image)
infsh app run google/gemini-2-5-flash-image --input '{
"prompt": "Add a rainbow in the sky",
"images": ["https://example.com/landscape.jpg"]
}'
High Resolution (4K)
infsh app run google/gemini-3-pro-image-preview --input '{
"prompt": "Detailed illustration of a medieval castle",
"resolution": "4K"
}'
With Google Search Grounding
infsh app run google/gemini-3-pro-image-preview --input '{
"prompt": "Current weather in Tokyo visualized as an artistic scene",
"enable_google_search": true
}'
Input Options
| Parameter | Type | Description |
|---|---|---|
prompt |
string | Required. What to generate or change |
images |
array | Input images for editing (up to 14) |
num_images |
integer | Number of images to generate |
aspect_ratio |
string | Output ratio: "1:1", "16:9", "9:16", "4:3", "3:4", "auto" |
resolution |
string | "1K", "2K", "4K" (Gemini 3 Pro only) |
output_format |
string | Output format for images |
enable_google_search |
boolean | Enable real-time info grounding |
Prompt Tips
Styles: photorealistic, illustration, watercolor, oil painting, digital art, anime, 3D render
Composition: close-up, wide shot, aerial view, macro, portrait, landscape
Lighting: natural light, studio lighting, golden hour, dramatic shadows, neon
Details: add specific details about textures, colors, mood, atmosphere
Sample Workflow
# 1. Generate sample input to see all options
infsh app sample google/gemini-3-pro-image-preview --save input.json
# 2. Edit the prompt
# 3. Run
infsh app run google/gemini-3-pro-image-preview --input input.json
Related Skills
# Full platform skill (all 150+ apps)
npx skills add inference-sh/skills@agent-tools
# All image generation models
npx skills add inference-sh/skills@ai-image-generation
# Video generation (for image-to-video)
npx skills add inference-sh/skills@ai-video-generation
Browse all image apps: infsh app list --category image
Documentation
- Running Apps - How to run apps via CLI
- Streaming Results - Real-time progress updates
- File Handling - Working with images
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
perigon-backend
Perigon ASP.NET Core + EF Core + Aspire conventions
perigon-agent
Pointers for Copilot/agents to apply Perigon conventions
perigon-angular
Angular 21+ standalone/Material/signal conventions for Perigon WebApp
fastapi-mastery
Comprehensive FastAPI development skill covering REST API creation, routing, request/response handling, validation, authentication, database integration, middleware, and deployment. Use when working with FastAPI projects, building APIs, implementing CRUD operations, setting up authentication/authorization, integrating databases (SQL/NoSQL), adding middleware, handling WebSockets, or deploying FastAPI applications. Triggered by requests involving .py files with FastAPI code, API endpoint creation, Pydantic models, or FastAPI-specific features.
context7-efficient
Token-efficient library documentation fetcher using Context7 MCP with 86.8% token savings through intelligent shell pipeline filtering. Fetches code examples, API references, and best practices for JavaScript, Python, Go, Rust, and other libraries. Use when users ask about library documentation, need code examples, want API usage patterns, are learning a new framework, need syntax reference, or troubleshooting with library-specific information. Triggers include questions like "Show me React hooks", "How do I use Prisma", "What's the Next.js routing syntax", or any request for library/framework documentation.
browser-use
Browser automation using Playwright MCP. Navigate websites, fill forms, click elements, take screenshots, and extract data. Use when tasks require web browsing, form submission, web scraping, UI testing, or any browser interaction.
Didn't find tool you were looking for?