Agent skill

openai-image-gen

Batch-generate images via OpenAI Images API. Random prompt sampler + `index.html` gallery.

Stars 163
Forks 31

Install this agent skill to your Project

npx add-skill https://github.com/majiayu000/claude-skill-registry/tree/main/skills/data/openai-image-gen-youjjwal-clawdbot

Metadata

Additional technical details for this skill

moltbot
{
    "emoji": "\ud83d\uddbc\ufe0f",
    "install": [
        {
            "id": "python-brew",
            "bins": [
                "python3"
            ],
            "kind": "brew",
            "label": "Install Python (brew)",
            "formula": "python"
        }
    ],
    "requires": {
        "env": [
            "OPENAI_API_KEY"
        ],
        "bins": [
            "python3"
        ]
    },
    "primaryEnv": "OPENAI_API_KEY"
}

SKILL.md

OpenAI Image Gen

Generate a handful of “random but structured” prompts and render them via the OpenAI Images API.

Run

bash
python3 {baseDir}/scripts/gen.py
open ~/Projects/tmp/openai-image-gen-*/index.html  # if ~/Projects/tmp exists; else ./tmp/...

Useful flags:

bash
# GPT image models with various options
python3 {baseDir}/scripts/gen.py --count 16 --model gpt-image-1
python3 {baseDir}/scripts/gen.py --prompt "ultra-detailed studio photo of a lobster astronaut" --count 4
python3 {baseDir}/scripts/gen.py --size 1536x1024 --quality high --out-dir ./out/images
python3 {baseDir}/scripts/gen.py --model gpt-image-1.5 --background transparent --output-format webp

# DALL-E 3 (note: count is automatically limited to 1)
python3 {baseDir}/scripts/gen.py --model dall-e-3 --quality hd --size 1792x1024 --style vivid
python3 {baseDir}/scripts/gen.py --model dall-e-3 --style natural --prompt "serene mountain landscape"

# DALL-E 2
python3 {baseDir}/scripts/gen.py --model dall-e-2 --size 512x512 --count 4

Model-Specific Parameters

Different models support different parameter values. The script automatically selects appropriate defaults based on the model.

Size

  • GPT image models (gpt-image-1, gpt-image-1-mini, gpt-image-1.5): 1024x1024, 1536x1024 (landscape), 1024x1536 (portrait), or auto
    • Default: 1024x1024
  • dall-e-3: 1024x1024, 1792x1024, or 1024x1792
    • Default: 1024x1024
  • dall-e-2: 256x256, 512x512, or 1024x1024
    • Default: 1024x1024

Quality

  • GPT image models: auto, high, medium, or low
    • Default: high
  • dall-e-3: hd or standard
    • Default: standard
  • dall-e-2: standard only
    • Default: standard

Other Notable Differences

  • dall-e-3 only supports generating 1 image at a time (n=1). The script automatically limits count to 1 when using this model.
  • GPT image models support additional parameters:
    • --background: transparent, opaque, or auto (default)
    • --output-format: png (default), jpeg, or webp
    • Note: stream and moderation are available via API but not yet implemented in this script
  • dall-e-3 has a --style parameter: vivid (hyper-real, dramatic) or natural (more natural looking)

Output

  • *.png, *.jpeg, or *.webp images (output format depends on model + --output-format)
  • prompts.json (prompt → file mapping)
  • index.html (thumbnail gallery)

Didn't find tool you were looking for?

Be as detailed as possible for better results