Agent skill
ai-image-generation
Generate AI images with FLUX, Gemini, Grok, Seedream, Reve and 50+ models via inference.sh CLI. Models: FLUX Dev LoRA, FLUX.2 Klein LoRA, Gemini 3 Pro Image, Grok Imagine, Seedream 4.5, Reve, ImagineArt. Capabilities: text-to-image, image-to-image, inpainting, LoRA, image editing, upscaling, text rendering. Use for: AI art, product mockups, concept art, social media graphics, marketing visuals, illustrations. Triggers: flux, image generation, ai image, text to image, stable diffusion, generate image, ai art, midjourney alternative, dall-e alternative, text2img, t2i, image generator, ai picture, create image with ai, generative ai, ai illustration, grok image, gemini image
Install this agent skill to your Project
npx add-skill https://github.com/aiskillstore/marketplace/tree/main/skills/inference-sh/ai-image-generation
SKILL.md
AI Image Generation
Generate images with 50+ AI models via inference.sh CLI.
Quick Start
# Install CLI
curl -fsSL https://cli.inference.sh | sh && infsh login
# Generate an image with FLUX
infsh app run falai/flux-dev-lora --input '{"prompt": "a cat astronaut in space"}'
Available Models
| Model | App ID | Best For |
|---|---|---|
| FLUX Dev LoRA | falai/flux-dev-lora |
High quality with custom styles |
| FLUX.2 Klein LoRA | falai/flux-2-klein-lora |
Fast with LoRA support (4B/9B) |
| Gemini 3 Pro | google/gemini-3-pro-image-preview |
Google's latest |
| Gemini 2.5 Flash | google/gemini-2-5-flash-image |
Fast Google model |
| Grok Imagine | xai/grok-imagine-image |
xAI's model, multiple aspects |
| Seedream 4.5 | bytedance/seedream-4-5 |
2K-4K cinematic quality |
| Seedream 4.0 | bytedance/seedream-4-0 |
High quality 2K-4K |
| Seedream 3.0 | bytedance/seedream-3-0-t2i |
Accurate text rendering |
| Reve | falai/reve |
Natural language editing, text rendering |
| ImagineArt 1.5 Pro | falai/imagine-art-1-5-pro-preview |
Ultra-high-fidelity 4K |
| Topaz Upscaler | falai/topaz-image-upscaler |
Professional upscaling |
Browse All Image Apps
infsh app list --category image
Examples
Text-to-Image with FLUX
infsh app run falai/flux-dev-lora --input '{
"prompt": "professional product photo of a coffee mug, studio lighting"
}'
Fast Generation with FLUX Klein
infsh app run falai/flux-2-klein-lora --input '{"prompt": "sunset over mountains"}'
Google Gemini 3 Pro
infsh app run google/gemini-3-pro-image-preview --input '{
"prompt": "photorealistic landscape with mountains and lake"
}'
Grok Imagine
infsh app run xai/grok-imagine-image --input '{
"prompt": "cyberpunk city at night",
"aspect_ratio": "16:9"
}'
Reve (with Text Rendering)
infsh app run falai/reve --input '{
"prompt": "A poster that says HELLO WORLD in bold letters"
}'
Seedream 4.5 (4K Quality)
infsh app run bytedance/seedream-4-5 --input '{
"prompt": "cinematic portrait of a woman, golden hour lighting"
}'
Image Upscaling
infsh app run falai/topaz-image-upscaler --input '{"image_url": "https://..."}'
Stitch Multiple Images
infsh app run infsh/stitch-images --input '{
"images": ["https://img1.jpg", "https://img2.jpg"],
"direction": "horizontal"
}'
Related Skills
# Full platform skill (all 150+ apps)
npx skills add inference-sh/skills@inference-sh
# FLUX-specific skill
npx skills add inference-sh/skills@flux-image
# Upscaling & enhancement
npx skills add inference-sh/skills@image-upscaling
# Background removal
npx skills add inference-sh/skills@background-removal
# Video generation
npx skills add inference-sh/skills@ai-video-generation
# AI avatars from images
npx skills add inference-sh/skills@ai-avatar-video
Browse all apps: infsh app list
Documentation
- Running Apps - How to run apps via CLI
- Image Generation Example - Complete image generation guide
- Apps Overview - Understanding the app ecosystem
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
perigon-backend
Perigon ASP.NET Core + EF Core + Aspire conventions
perigon-agent
Pointers for Copilot/agents to apply Perigon conventions
perigon-angular
Angular 21+ standalone/Material/signal conventions for Perigon WebApp
fastapi-mastery
Comprehensive FastAPI development skill covering REST API creation, routing, request/response handling, validation, authentication, database integration, middleware, and deployment. Use when working with FastAPI projects, building APIs, implementing CRUD operations, setting up authentication/authorization, integrating databases (SQL/NoSQL), adding middleware, handling WebSockets, or deploying FastAPI applications. Triggered by requests involving .py files with FastAPI code, API endpoint creation, Pydantic models, or FastAPI-specific features.
context7-efficient
Token-efficient library documentation fetcher using Context7 MCP with 86.8% token savings through intelligent shell pipeline filtering. Fetches code examples, API references, and best practices for JavaScript, Python, Go, Rust, and other libraries. Use when users ask about library documentation, need code examples, want API usage patterns, are learning a new framework, need syntax reference, or troubleshooting with library-specific information. Triggers include questions like "Show me React hooks", "How do I use Prisma", "What's the Next.js routing syntax", or any request for library/framework documentation.
browser-use
Browser automation using Playwright MCP. Navigate websites, fill forms, click elements, take screenshots, and extract data. Use when tasks require web browsing, form submission, web scraping, UI testing, or any browser interaction.
Didn't find tool you were looking for?