Agent skill
fal-ai-image
Generate/edit images via fal.ai nano-banana-pro. Supports reference images (Edit mode) and text rendering in any language including Cyrillic. ALWAYS read SKILL.md before first use.
Install this agent skill to your Project
npx add-skill https://github.com/artwist-polyakov/polyakov-claude-skills/tree/main/plugins/fal-ai-image/skills/fal-ai-image
SKILL.md
fal-ai-image
Generate images via fal.ai nano-banana-pro (Gemini 3 Pro Image). Best for: infographics, text rendering, complex compositions.
STOP — Read Before Acting
- DO NOT use Pillow, ImageMagick, or any post-processing for text/logo overlay — this model renders text natively, including Cyrillic and CJK
- DO NOT use Generate mode when user provides reference images — use Edit mode
- DO NOT launch general-purpose subagents — use
Taskwithsubagent_type: "Bash"andmodel: "haiku" - DO NOT skip uploading local files — run
upload.shfirst to get URLs foredit.sh - DO NOT guess script parameters — check the tables below
Quick Start Decision
User gave reference images? → Edit mode (upload.sh → edit.sh)
User wants text-only gen? → Generate mode (generate.sh)
Multiple images needed? → Parallel Bash/haiku subagents
Model Capabilities
- Excellent text rendering (Latin, Cyrillic, CJK) — no post-processing needed
- Composes logos, product photos, and text into banners in a single pass
- Understands layout instructions ("left side text, right side product photo")
- Handles complex infographics, charts, and diagrams
- Edit mode blends reference images naturally with prompt guidance
Compatibility
Scripts are POSIX sh compatible — work in cloud sandboxes (/bin/sh) and locally (bash).
No bashisms: [[ ]], ${BASH_SOURCE}, source etc. are NOT used.
Config
Requires FAL_KEY in config/.env or environment.
Get key: https://fal.ai/dashboard/keys
Two Modes
1. Generate (text-to-image)
Create images from text prompt only.
Script: scripts/generate.sh
2. Edit (image-to-image)
Create images using reference images (up to 14).
Script: scripts/edit.sh
Workflow
IMPORTANT: Run generation via Task tool with Haiku subagent to avoid blocking main context.
For Generate mode:
-
Clarify params (if not specified):
- Aspect ratio:
1:1,16:9,9:16,4:3, etc. - Resolution:
1K(default),2K,4K
- Aspect ratio:
-
Propose save path based on project structure:
- Check for
./images/,./assets/,./static/ - Suggest: "Save to
./images/infographic_coffee.png?"
- Check for
-
Show price & confirm:
Cost: $0.15/image (4K: $0.30) Confirm? (yes/no) -
Launch subagent:
Task tool: - subagent_type: "Bash" - model: "haiku" - run_in_background: true - prompt: Run generate.sh with params -
Report result: Parse JSON output, show image URL, read saved file.
For Edit mode:
-
Get reference images:
- If URLs provided → use directly
- If local files → run
upload.shfirst to get URLs
-
Clarify prompt: What to do with references?
-
Show price & confirm: Same as generate
-
Launch subagent with
edit.sh -
Report result
Scripts
generate.sh
sh scripts/generate.sh \
--prompt "infographic about coffee brewing" \
--aspect-ratio "9:16" \
--resolution "1K" \
--output-dir "./images" \
--filename "coffee_infographic"
| Param | Required | Default | Values |
|---|---|---|---|
--prompt |
yes | - | text |
--aspect-ratio |
no | 1:1 | 21:9, 16:9, 3:2, 4:3, 5:4, 1:1, 4:5, 3:4, 2:3, 9:16 |
--resolution |
no | 1K | 1K, 2K, 4K |
--num-images |
no | 1 | 1-4 |
--output-dir |
no | - | path |
--filename |
no | generated | base name |
--web-search |
no | false | flag |
edit.sh
sh scripts/edit.sh \
--prompt "combine these into a collage" \
--image-urls "https://example.com/img1.png,https://example.com/img2.png" \
--aspect-ratio "16:9" \
--output-dir "./images" \
--filename "collage"
| Param | Required | Default | Values |
|---|---|---|---|
--prompt |
yes | - | text |
--image-urls |
yes | - | comma-separated URLs (max 14) |
--aspect-ratio |
no | auto | auto, 1:1, 16:9, etc. |
--resolution |
no | 1K | 1K, 2K, 4K |
--num-images |
no | 1 | 1-4 variations |
--output-dir |
no | - | path |
--filename |
no | edited | base name |
upload.sh (for local files)
# Get URL for local file
URL=$(sh scripts/upload.sh --file /path/to/image.png)
# Or get base64 data URI (for small files)
URI=$(sh scripts/upload.sh --file /path/to/image.png --base64)
Parallel Generation
For multiple images — launch several subagents in parallel:
Task 1: generate "cat in space" → cat_space.png
Task 2: generate "dog on moon" → dog_moon.png
Task 3: generate "bird in ocean" → bird_ocean.png
Each runs independently via Haiku, results collected when done.
Pricing
- Generate: $0.15/image
- Edit: $0.15/edit
- 4K resolution: $0.30 (2x)
- Web search: +$0.015
Formula: price = num_images * (resolution == "4K" ? 0.30 : 0.15) + (web_search ? 0.015 : 0)
Notes
- URLs expire in ~1 hour — save locally if needed
- Uploaded files stored 7 days on fal.ai, then auto-deleted
- Model excels at text rendering and infographics
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
genome-analizer
scrapedo-web-scraper
docx-contracts
telegraph-publisher
Publish pages to Telegraph with images, YouTube embeds, and diagrams. Supports auto-split for long articles. ALWAYS read SKILL.md before first use.
ssh-remote-connection
SSH connection to remote servers. Use when you need to execute commands on a remote server, check logs, restart services, or manage Docker containers.
crawl4ai-seo
SEO-краулер сайтов на базе Crawl4AI. Полный аудит страниц: title, meta, H1, canonical, breadcrumbs, навигация, внутренние ссылки. Инвентаризация сайта, навигационный аудит, сравнение лендингов, анализ конкурентов. Работает для Google и Яндекс SEO (Cyrillic URL, коммерческие факторы, региональность). Связка с yandex-search-api, yandex-metrika, yandex-webmaster, scrapedo-web-scraper. Triggers: crawl4ai, seo crawl, site audit, page inventory, site inventory, on-page audit, internal links, internal linking audit, navigation audit, landing comparison, competitor analysis, competitor pages, orphan pages, technical seo, аудит сайта, краулер, перелинковка, навигационный аудит.
Didn't find tool you were looking for?