Agent skill

create-image

Create images using AI generation (FLUX.1-schnell, Ollama), Mermaid diagrams, or placeholders. Supports multiple backends with automatic fallback.

View SKILL.md on GitHub Repository

Stars 163

Forks 31

Install this agent skill to your Project

npx add-skill https://github.com/majiayu000/claude-skill-registry/tree/main/skills/data/create-image

Metadata

Additional technical details for this skill

short description: Create images (AI-generated, Mermaid, placeholders)

SKILL.md

create-image

Generate images using FREE AI image generation backends.

Features

Ollama (local) - Z-Image Turbo or FLUX2-Klein via Ollama (FREE, no internet)
FLUX.1-schnell - AI-generated images via HuggingFace (FREE remote)
Mermaid diagrams - Flowcharts and architecture diagrams (FREE)
Placeholder images - Random grayscale from picsum.photos (FREE)
Solid color - Gray box with text label (always works)
Size control - Specify dimensions for PDF embedding (auto-resized)

Quick Start

bash

cd .pi/skills/create-image

# Generate an AI image (uses FLUX.1-schnell)
uv run --script generate.py "hardware verification flowchart for microprocessor" \
  --output test_figure.png \
  --size 400x600

# Generate with specific backend
uv run --script generate.py "network security architecture" \
  --output security_arch.png \
  --size 800x600 \
  --backend flux

# Use placeholder fallback
uv run --script generate.py "placeholder" \
  --output placeholder.png \
  --size 400x300 \
  --backend placeholder

Commands

`generate` - Create an image

bash

uv run --script generate.py "<prompt>" [options]

Arguments:

Argument	Description
`prompt`	Description of the image to generate

Options:

Option	Short	Description	Default
`--output`	`-o`	Output file path	`fixture_image.png`
`--size`	`-s`	Image dimensions (WxH)	`512x512`
`--backend`	`-b`	Generation backend	`auto`

Backends

Backend	Description	Requires	Cost
`ollama`	Z-Image/FLUX2 local generation	Ollama + model	FREE (local)
`flux`	FLUX.1-schnell AI generation	`HF_TOKEN`	FREE (remote)
`mermaid`	Flowchart/diagram generation	`mmdc` CLI	FREE
`placeholder`	picsum.photos (grayscale)	Nothing	FREE
`solid`	Gray box with text label	Pillow	FREE
`auto`	Try backends in order	Any available	-

Setup

Option 1: Ollama (macOS only - MLX framework)

Note: Ollama image generation currently only works on macOS (Apple Silicon). Linux/NVIDIA support is "coming soon" per Ollama docs.

bash

# macOS only
ollama pull x/z-image-turbo
# or
ollama pull x/flux2-klein

Option 2: HuggingFace Token (FREE Remote)

Get a FREE HuggingFace token from huggingface.co/settings/tokens:

bash

export HF_TOKEN="hf_your_token_here"

Option 3: Mermaid (for Diagrams)

bash

npm install -g @mermaid-js/mermaid-cli

Example Prompts

Security Documents

bash

"APT attack kill chain diagram with reconnaissance, weaponization, delivery, exploitation phases"
"network intrusion detection system architecture"
"malware analysis workflow flowchart"

Engineering Documents

bash

"hardware verification flow for microprocessor with RTL, synthesis, and timing analysis"
"FPGA design pipeline from HDL to bitstream"
"embedded systems boot sequence diagram"

Scientific Documents

bash

"machine learning pipeline with data preprocessing, training, and inference stages"
"experimental methodology flowchart"
"system architecture diagram with numbered components"

Cached Images (Reuse Before Generating)

Pre-generated images are available in cached_images/ - use these first to avoid unnecessary API calls:

File	Description	Size
`decorative.png`	Abstract cover/decorative illustration	512x512
`flowchart.png`	Technical workflow/process diagram	512x512
`network_arch.png`	Network/system architecture diagram	512x512

bash

# Copy cached image instead of generating
cp cached_images/flowchart.png /path/to/output.png

Integration with PDF Generation

After generating images, embed them in PDFs:

python

import fitz  # PyMuPDF

doc = fitz.open()
page = doc.new_page()

# Insert generated image
img_rect = fitz.Rect(50, 200, 450, 500)  # x0, y0, x1, y1
page.insert_image(img_rect, filename="test_figure.png")

doc.save("fixture_with_figure.pdf")

Dependencies

toml

dependencies = [
    "huggingface_hub>=0.26.0",
    "httpx",
    "typer",
    "pillow",
]

Maintainer

majiayu000 Core maintainer

Source details

Full Name: majiayu000/claude-skill-registry
Branch: main
Path in repo: skills/data/create-image
License: MIT License

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

majiayu000/claude-skill-registry

agent-ops-spec

Manage specification documents in .agent/specs/. Use when user provides requirements, acceptance criteria, or feature descriptions that need to be tracked and validated against implementation.

163 31

Explore

majiayu000/claude-skill-registry

agent-ops-state

Maintain .agent state files. Use at session start, after meaningful steps, and before concluding: read/update constitution/memory/focus/issues/baseline consistently.

163 31

Explore

majiayu000/claude-skill-registry

agent-ops-spec

Manage specification documents in .agent/specs/. Use when user provides requirements, acceptance criteria, or feature descriptions that need to be tracked and validated against implementation.

163 31

Explore

majiayu000/claude-skill-registry

agent-ops-testing

Test strategy, execution, and coverage analysis. Use when designing tests, running test suites, or analyzing test results beyond baseline checks.

163 31

Explore

majiayu000/claude-skill-registry

agent-ops-testing

Test strategy, execution, and coverage analysis. Use when designing tests, running test suites, or analyzing test results beyond baseline checks.

163 31

Explore

majiayu000/claude-skill-registry

agent-ops-state

Maintain .agent state files. Use at session start, after meaningful steps, and before concluding: read/update constitution/memory/focus/issues/baseline consistently.

163 31

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

Metadata

SKILL.md

create-image

Features

Quick Start

Commands

generate - Create an image

Backends

Setup

Option 1: Ollama (macOS only - MLX framework)

Option 2: HuggingFace Token (FREE Remote)

Option 3: Mermaid (for Diagrams)

Example Prompts

Security Documents

Engineering Documents

Scientific Documents

Cached Images (Reuse Before Generating)

Integration with PDF Generation

Dependencies

Recommended Agent Skills

agent-ops-spec

agent-ops-state

agent-ops-spec

agent-ops-testing

agent-ops-testing

agent-ops-state

`generate` - Create an image