Agent skill
tts-skill
Multi-engine text-to-speech skill. Supports Qwen3-TTS local voice cloning, VoiceCraft online TTS, and OpenAI TTS.
Install this agent skill to your Project
npx add-skill https://github.com/aiskillstore/marketplace/tree/main/skills/tabortao/tts-skill
SKILL.md
đď¸ TTS-Skill â Multi-Engine Text-to-Speech
TTS-Skill provides a single entrypoint for generating speech using multiple backends, with consistent output naming and progress feedback for long-running jobs.
Engines
- qwen3-tts: local voice cloning with a reference audio + transcript
- edge-tts: online voices with speed/pitch/style controls
- openai-tts: OpenAI speech generation via API
Command Syntax
/tts-skill [engine] [text] --voice [voice-keyword] [other options]
If you use the Python entrypoint:
python tts-skill.py [engine] [text] --voice [voice-keyword]
Text Input
Pass text as a positional argument, or use --text-file / -f to read from a file.
Example:
python tts-skill.py qwen3-tts --text-file "input\\text.txt" --voice ĺŻĺ°ĺ°ć
Notes:
--text-filesupports relative and absolute paths; relative paths are resolved from your current working directory- If both positional text and
--text-fileare provided,--text-filetakes priority - UTF-8 is recommended (UTF-8 BOM is supported); on decode error it falls back to GBK
You can also call engine scripts directly:
python engines/qwen3-tts-cli.py --text-file "input\\text.txt" --voice ĺŻĺ°ĺ°ć
python engines/edge-tts-cli.py --text-file "input\\text.txt" --voice xiaoxiao
python engines/openai-tts-cli.py --text-file "input\\text.txt" --voice alloy
Local Voice Assets (Qwen3-TTS)
To add a clone voice, put a matching pair of files in assets/:
assets/Lei.wav
assets/Lei.txt
Supported audio formats: .wav, .mp3, .m4a, .flac.
Then:
python tts-skill.py qwen3-tts "ćľčŻććŹ" --voice Lei
Output
If --output is not provided:
- Output directory:
output/ - Filename pattern:
YYYYMMDD_HHMMSS_<first-6-chars>.<ext>
Progress & Timing (Qwen3-TTS)
Qwen3-TTS jobs print a live progress bar with ETA. After completion, tts-skill.py prints:
- total runtime
- total chars and Chinese chars
- average seconds per Chinese character (or per char if no Chinese)
Project Layout
tts-skill/
âââ .trae/
â âââ plans/
âââ assets/
â âââ Lei.txt
â âââ ĺŻĺ°ĺ°ć.txt
â âââ ĺ¸éč¨.txt
â âââ 达俥.txt
âââ engines/
â âââ edge-tts-cli.py
â âââ edge-tts.config
â âââ openai-tts-cli.py
â âââ openai-tts.config
â âââ qwen3-tts-cli.py
â âââ qwen3-tts.config
âââ input/
â âââ text.txt
âââ output/
âââ tts-skill.py
âââ INSTALL.md
âââ INSTALL.zh-CN.md
âââ README.md
âââ README.zh-CN.md
âââ SKILL.md
âââ SKILL.zh-CN.md
Chinese Spec
See SKILL.zh-CN.md.
Didn't find tool you were looking for?