Agent skill
tts-skill
Multi-engine text-to-speech skill. Supports Qwen3-TTS local voice cloning, VoiceCraft online TTS, and OpenAI TTS.
Install this agent skill to your Project
npx add-skill https://github.com/aiskillstore/marketplace/tree/main/skills/tabortao/tts-skill
SKILL.md
đď¸ TTS-Skill â Multi-Engine Text-to-Speech
TTS-Skill provides a single entrypoint for generating speech using multiple backends, with consistent output naming and progress feedback for long-running jobs.
Engines
- qwen3-tts: local voice cloning with a reference audio + transcript
- edge-tts: online voices with speed/pitch/style controls
- openai-tts: OpenAI speech generation via API
Command Syntax
/tts-skill [engine] [text] --voice [voice-keyword] [other options]
If you use the Python entrypoint:
python tts-skill.py [engine] [text] --voice [voice-keyword]
Text Input
Pass text as a positional argument, or use --text-file / -f to read from a file.
Example:
python tts-skill.py qwen3-tts --text-file "input\\text.txt" --voice ĺŻĺ°ĺ°ć
Notes:
--text-filesupports relative and absolute paths; relative paths are resolved from your current working directory- If both positional text and
--text-fileare provided,--text-filetakes priority - UTF-8 is recommended (UTF-8 BOM is supported); on decode error it falls back to GBK
You can also call engine scripts directly:
python engines/qwen3-tts-cli.py --text-file "input\\text.txt" --voice ĺŻĺ°ĺ°ć
python engines/edge-tts-cli.py --text-file "input\\text.txt" --voice xiaoxiao
python engines/openai-tts-cli.py --text-file "input\\text.txt" --voice alloy
Local Voice Assets (Qwen3-TTS)
To add a clone voice, put a matching pair of files in assets/:
assets/Lei.wav
assets/Lei.txt
Supported audio formats: .wav, .mp3, .m4a, .flac.
Then:
python tts-skill.py qwen3-tts "ćľčŻććŹ" --voice Lei
Output
If --output is not provided:
- Output directory:
output/ - Filename pattern:
YYYYMMDD_HHMMSS_<first-6-chars>.<ext>
Progress & Timing (Qwen3-TTS)
Qwen3-TTS jobs print a live progress bar with ETA. After completion, tts-skill.py prints:
- total runtime
- total chars and Chinese chars
- average seconds per Chinese character (or per char if no Chinese)
Project Layout
tts-skill/
âââ .trae/
â âââ plans/
âââ assets/
â âââ Lei.txt
â âââ ĺŻĺ°ĺ°ć.txt
â âââ ĺ¸éč¨.txt
â âââ 达俥.txt
âââ engines/
â âââ edge-tts-cli.py
â âââ edge-tts.config
â âââ openai-tts-cli.py
â âââ openai-tts.config
â âââ qwen3-tts-cli.py
â âââ qwen3-tts.config
âââ input/
â âââ text.txt
âââ output/
âââ tts-skill.py
âââ INSTALL.md
âââ INSTALL.zh-CN.md
âââ README.md
âââ README.zh-CN.md
âââ SKILL.md
âââ SKILL.zh-CN.md
Chinese Spec
See SKILL.zh-CN.md.
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
perigon-backend
Perigon ASP.NET Core + EF Core + Aspire conventions
perigon-agent
Pointers for Copilot/agents to apply Perigon conventions
perigon-angular
Angular 21+ standalone/Material/signal conventions for Perigon WebApp
fastapi-mastery
Comprehensive FastAPI development skill covering REST API creation, routing, request/response handling, validation, authentication, database integration, middleware, and deployment. Use when working with FastAPI projects, building APIs, implementing CRUD operations, setting up authentication/authorization, integrating databases (SQL/NoSQL), adding middleware, handling WebSockets, or deploying FastAPI applications. Triggered by requests involving .py files with FastAPI code, API endpoint creation, Pydantic models, or FastAPI-specific features.
context7-efficient
Token-efficient library documentation fetcher using Context7 MCP with 86.8% token savings through intelligent shell pipeline filtering. Fetches code examples, API references, and best practices for JavaScript, Python, Go, Rust, and other libraries. Use when users ask about library documentation, need code examples, want API usage patterns, are learning a new framework, need syntax reference, or troubleshooting with library-specific information. Triggers include questions like "Show me React hooks", "How do I use Prisma", "What's the Next.js routing syntax", or any request for library/framework documentation.
browser-use
Browser automation using Playwright MCP. Navigate websites, fill forms, click elements, take screenshots, and extract data. Use when tasks require web browsing, form submission, web scraping, UI testing, or any browser interaction.
Didn't find tool you were looking for?