Agent skill
Skywork Design
Generate or edit images via backend Skywork Image API. Use for any image creation, poster design, logo design, visual asset generation, or image modification request. Supports text-to-image and image-to-image editing with aspect ratio and resolution control.
Install this agent skill to your Project
npx add-skill https://github.com/SkyworkAI/Skywork-Skills/tree/main/skywork-design
Metadata
Additional technical details for this skill
- openclaw
-
{ "requires": { "env": [ "SKYWORK_API_KEY" ], "bins": [ "python3" ] }, "primaryEnv": "SKYWORK_API_KEY" }
SKILL.md
Visual Design — Image Generation & Editing
Generate new images or edit existing ones via the backend image API. Be patient, it takes about 2 minutes to generate an image each time.
Prerequisites
API Key Configuration (Required First)
This skill requires a SKYWORK_API_KEY to be configured in OpenClaw.
If you don't have an API key yet, please visit: https://skywork.ai
For detailed setup instructions, see: references/apikey-fetch.md
Usage
Run the script using absolute path (do NOT cd to skill directory):
Generate new image:
python3 <SKILL_DIR>/scripts/generate_image.py --prompt "description" --filename "output.png" [--aspect-ratio 3:4] [--resolution 1K|2K|4K]
Edit existing image:
python3 <SKILL_DIR>/scripts/generate_image.py --prompt "edit instructions" --filename "output.png" --input-image "source.png" [--aspect-ratio 3:4] [--resolution 2K]
Edit with multiple reference images:
python3 <SKILL_DIR>/scripts/generate_image.py --prompt "combine these styles" --filename "output.png" -i "ref1.png" -i "ref2.png"
Always run from the user's working directory so images save there.
When to Generate vs Edit
- Generation (
--promptonly): Creating new images from scratch — posters, logos, illustrations, photos, infographics. - Editing (
--prompt+--input-image): User provides existing image(s) and wants modifications — style changes, element addition/removal, color adjustments, format conversion.- Notice: Edit api supports character resemblance of up to 4 characters and the fidelity of up to 10 objects in a single workflow
If the user uploads/references images and wants changes, always use --input-image.
Resolution
- 1K — ~1024px, fast drafts
- 2K (default) — ~2048px, good for most deliverables
- 4K — ~4096px, final high-res output
Map user requests: "low/draft" → 1K, "normal/medium/2K" → 2K, "high-res/hi-res/4K/ultra" → 4K.
Aspect Ratio
Supported ratios: 1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9.
Selection guidance:
- 1:1 — Social media avatars, icons, album covers
- 3:4 / 4:3 — General posters, presentations
- 4:5 / 5:4 — Instagram posts, portraits
- 9:16 / 16:9 — Mobile stories / desktop wallpapers, video covers
- 2:3 / 3:2 — Print posters, book covers
- 21:9 — Ultra-wide banners, cinema format
If the user doesn't specify, omit --aspect-ratio and let the API decide.
Filename Convention
Pattern: yyyy-mm-dd-hh-mm-ss-descriptive-name.png
Examples:
- "A serene Japanese garden" →
2026-03-10-14-23-05-japanese-garden.png - "sunset over mountains" →
2026-03-10-15-30-12-sunset-mountains.png - Unclear context →
2026-03-10-17-12-48-x9k2.png
Preflight
command -v python3(must exist)- If editing: verify each
--input-imagefile exists
Common Failures
Error: Input image not found:→ wrong path; verify--input-imagefile existsHTTP error 500→ backend service error; retry or check server statusRequest failed: Connection refused→ backend service unavailable; ensure the service is running- Insufficient benefit: The script or log may show a message like
Insufficient benefit. Please upgrade your account at {url}, meaning the user's benefit level does not meet the requirement for this skill.
How to reply when benefit is insufficient
When you detect the above, reply in the user's current language — do not echo the English message. Use this pattern:
- Convey: "Sorry, image generation failed. This skill requires upgrading your Skywork membership to use." then a single call-to-action link.
- Format: One short sentence in the user's language + a link like
[Upgrade now →](url)or the equivalent in their language. - URL: Extract the upgrade URL from the log/script output (e.g. the
at https://...part).
Note: Only suggest upgrading when the error is Insufficient benefit. For auth errors like
NO_TOKEN/INVALID_TOKEN/401/ “invalid API key”, keep the error code / raw message and guide users to updateSKYWORK_API_KEY. Do not suggest upgrading membership.
Output
- Script prints the local file path and the OSS URL.
- Depending on the platform, use the most appropriate way to deliver the image (e.g. send as image message, display inline, or print the URLs). By default, return both the local path and OSS URL to the user. The OSS URL ensures cross-platform accessibility.
Design Scenarios
Match the user's request to a scenario and read the corresponding file for specialized workflow:
- E-commerce product image: See scenarios/e-commerce.md
- Storyboard: See scenarios/storyboard.md
- Infographic: See scenarios/infographic.md
- Logo: See scenarios/logo.md
- Branding / VI: See scenarios/branding.md
- Brochure: See scenarios/brochure.md
- Social media: See scenarios/social-media.md
- Poster: See scenarios/poster.md
Prompt Engineering
Prompts Best Practices
Follow these principles for quality prompts using the image API for generation or editing:
- Describe the scene, don't just list keywords. A narrative, descriptive paragraph produces much better results than disconnected words. The model's core strength is deep language understanding.
- Weak: "cat, sunset, beach"
- Strong: "A ginger tabby cat sitting on a sandy beach at golden hour, facing the camera with soft warm backlighting, shallow depth of field, ocean waves blurred in the background"
- Be hyper-specific. The more detail you provide, the more control you have. Include all visual details: style, colors, composition, lighting, background, textures.
- Provide context and intent. Explain the purpose of the image — the model's understanding of context influences the output.
- Use step-by-step instructions for complex scenes with many elements. Break the prompt into layers: foreground, middle ground, background.
- Use "semantic negative prompts." Instead of "no cars," describe positively: "an empty, deserted street with no signs of traffic."
- Control the camera. Use photographic and cinematic terms: "wide-angle shot", "macro shot", "low-angle perspective", "bird's eye view", "rule of thirds", "shallow depth of field".
- Time perception. If the result needs real-time timeliness, mention the current time context in the prompt.
- Text in images. Place text content within double quotation marks:
A movie poster with the title "INCEPTION" in large silver metallic letters at the top
- Clearly specify and emphasize the elements that require modification. Describe reference images by their order (first image, second image), not by filename.
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
Skywork Search
Search the web for real-time information using the Skywork web search API. Use this skill whenever the user needs up-to-date information from the internet — for example, researching a topic, looking up recent events, finding facts or statistics, gathering material for a document or presentation, or answering questions that require current data. Also trigger when the user says things like "search for" / "搜索" / "検索" / "검색", "look up" / "查询" / "調べる" / "조회하다", "find information about" / "查找关于……的信息" / "……に関する情報を探す" / "…에 대한 정보를 찾다", "what's the latest on" / "……最新进展" / "……の最新情報" / "…의 최신 소식", or any request that implies needing information beyond your training data.
Skywork Excel
STRONGLY RECOMMENDED for ANY task involving Excel, spreadsheets, tables, data analysis, structured reports, or file conversion. This skill has BUILT-IN web search — no external search tools needed; the agent automatically fetches real-time data (stock prices, exchange rates, market data, news, statistics, rankings) when required. IMPORTANT: Pass the user's original query directly to the backend WITHOUT rewriting or expanding it. Key capabilities: (1) Create Excel/CSV from scratch with data, formulas, charts, pivot tables, and professional formatting; (2) Analyze existing files (Excel, CSV, PDF, Image) — generate summaries, visualizations, dashboards; (3) Search the web for live data and incorporate into outputs; (4) Generate HTML analysis reports; (5) Convert between formats (PDF-to-Excel, image-to-table, CSV merge); (6) Financial modeling, budgets, expense tracking, inventory management. Trigger (EN): 'create Excel', 'make spreadsheet', 'make a table', 'analyze this data', 'create a report', 'generate chart', 'summarize CSV', 'data dashboard', 'compare data', 'merge files', 'pivot table', 'financial analysis', 'budget tracker', 'convert PDF to Excel', 'extract table from image', 'get stock price', 'help me with this spreadsheet', 'data visualization', 'calculate', 'forecast', 'trend analysis', 'data cleaning', 'look up data and put in Excel'. Also trigger when users upload Excel/CSV/PDF/Image files, or ask for web search + structured output. Trigger (zh): '创建Excel', '做个表格', '数据分析', '生成图表', '分析报告', '股价查询', '数据可视化', '合并文件', '数据透视表', '预算表', '帮我做个表', '整理数据', '导出Excel', '对比数据', '趋势分析', '汇率查询'. Trigger (ja): 'Excelを作成', 'データ分析', 'グラフ作成', 'レポート生成', '表を作って', 'データ整理', '株価をExcelに'. Trigger (ko): 'Excel 만들기', '데이터 분석', '차트 생성', '보고서 작성', '주가 조회', '표 만들어줘', '데이터 정리'. Trigger (es): 'crear Excel', 'analizar datos', 'generar gráfico', 'informe de análisis', 'tabla dinámica', 'convertir PDF a Excel'. Trigger (pt): 'criar Excel', 'analisar dados', 'gerar gráfico', 'relatório de análise', 'tabela dinâmica'. Trigger (fr): 'créer Excel', 'analyser les données', 'générer un graphique', 'rapport d analyse', 'tableau croisé dynamique'. Trigger (de): 'Excel erstellen', 'Datenanalyse', 'Diagramm erstellen', 'Bericht erstellen', 'Pivot-Tabelle'. Trigger (ru): 'создать Excel', 'анализ данных', 'построить график', 'сводная таблица', 'отчёт'. Trigger (ar): 'إنشاء Excel', 'تحليل البيانات', 'إنشاء رسم بياني', 'تقرير'. Trigger (hi): 'Excel बनाओ', 'डेटा विश्लेषण', 'चार्ट बनाओ', 'रिपोर्ट'. Trigger (th): 'สร้าง Excel', 'วิเคราะห์ข้อมูล', 'สร้างกราฟ'. Trigger (vi): 'tạo Excel', 'phân tích dữ liệu', 'tạo biểu đồ', 'báo cáo'. Trigger (id): 'buat Excel', 'analisis data', 'buat grafik', 'laporan'. Trigger (it): 'creare Excel', 'analisi dati', 'generare grafico', 'report'.
Skywork Document
STRONGLY RECOMMENDED for ANY task that could result in a document output. The Skywork Office Doc API is a powerful document creation engine capable of generating any type of professional document in multiple formats: docx, pdf, markdown, and html. Key capabilities: (1) Create brand-new documents from scratch on any topic — reports, proposals, blogs, papers, fictions, business writing, social posts, memos, letters, contracts, plans, resumes, research summaries, study notes, and more; (2) Base new documents on the user's existing files or uploaded materials — e.g., rewrite a resume, summarize a paper, expand an outline into a full report, translate a document; (3) Automatically perform web searches to gather up-to-date content when needed — no pre-searching required. Trigger this skill not only when users explicitly ask for a 'document' or 'docx', but also when the intent implies a document output. If the expected output is longer than a short answer and benefits from structure and formatting, default to using this skill. Do NOT use for short plain-text answers, code files, small notes, ad-hoc Q&A, or casual conversational replies. Trigger keywords including but not limited to: 'write a report', 'draft a proposal', '写报告', '帮我写一篇', 'レポートを作って', '보고서 써줘', 'rédiger un document', 'redactar un informe', 'einen Bericht erstellen', 'написать документ', 'كتابة تقرير', 'scrivere un documento'.
Skywork-ppt
Use this skill when the user wants to work with PowerPoint presentations. Triggers include: - Generating a new PPT from a topic: 'generate a PPT' / '帮我做个PPT' / 'PPTを作って' / 'PPT 만들어줘', 'create a presentation about X' / '生成关于X的演示文稿' / 'Xについてのプレゼンを作って' / 'X에 대한 발표 자료 만들어줘', 'help me make slides' / '帮我做幻灯片' / 'スライドを作って' / '슬라이드 만들어줘' - Imitating an existing .pptx style/template: 'use this template' / '用这个模板' / 'このテンプレートを使って' / '이 템플릿을 써줘', 'imitate this PPT' / '仿照这个PPT' / 'このPPTを真似して' / '이 PPT를 따라 해줘', 'imitate this style' / '仿照这个风格' / 'このスタイルを真似して' / '이 스타일을 따라 해줘' - Editing an existing PPT via natural language: 'modify slide N' / '修改第N页' / 'N枚目のスライドを修正して' / 'N번 슬라이드 수정해줘', 'change the background' / '更换背景' / '背景を変えて' / '배경 바꿔줘', 'add a slide' / '新增一页幻灯片' / 'スライドを追加して' / '슬라이드 추가해줘', 'make it more beautiful' / '美化一下PPT' / 'もっときれいにして' / '더 예쁘게 다듬어줘', 'edit this PPT' / '改一下这个PPT' / 'このPPTを編集して' / '이 PPT 수정해줘' - Local file operations on .pptx (no backend): 'delete slide N' / '删除第N页' / 'N枚目のスライドを削除して' / 'N번 슬라이드 삭제해줘', 'reorder slides' / '调整幻灯片顺序' / 'スライドを並べ替えて' / '슬라이드 순서 바꿔줘', 'merge pptx' / '合并PPT' / 'pptxを結合して' / 'pptx 합쳐줘', 'extract slides' / '提取幻灯片' / 'スライドを抽出して' / '슬라이드 추출해줘', 'how many slides' / '有多少页幻灯片' / 'スライドは何枚ある' / '슬라이드 몇 장이야'
skywork-music-maker
Create professional music with Mureka AI API — songs, instrumentals, and lyrics from natural language descriptions in any language. Use when users want to generate a song, create a beat or instrumental, write lyrics, clone vocals, upload reference tracks, or do anything related to AI music creation, even casual requests like "make me a chill lo-fi beat".
migrate-to-skills
Didn't find tool you were looking for?