Agent skill

zai-tts

Text-to-speech conversion using GLM-TTS service via the `uvx zai-tts` command for generating audio from text. Use when (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, podcast, driving, cooking). (3) Using pre-cloned voices for speech.

View SKILL.md on GitHub Repository

Stars 110

Forks 12

Install this agent skill to your Project

npx add-skill https://github.com/aahl/skills/tree/main/skills/zai-tts

SKILL.md

Zai-TTS

Generate high-quality text-to-speech audio using GLM-TTS service via the uvx zai-tts command. Before using this skill, you need to configure the environment variables ZAI_AUDIO_USERID and ZAI_AUDIO_TOKEN, which can be obtained by login audio.z.ai and executing localStorage['auth-storage'] in the console via F12 Developer Tools.

Usage

shell

uvx zai-tts -t "{msg}" -o {tempdir}/{filename}.wav
uvx zai-tts -f path/to/file.txt -o {tempdir}/{filename}.wav

Changing speed, volume

shell

uvx zai-tts -t "{msg}" -o {tempdir}/{filename}.wav --speed 1.5
uvx zai-tts -t "{msg}" -o {tempdir}/{filename}.wav --speed 1.5 --volume 2

Changing the voice

shell

uvx zai-tts -t "{msg}" -o {tempdir}/{filename}.wav --voice system_002

Available voices

system_001: Lila. A cheerful, standard-pronunciation female voice system_002: Chloe. A gentle, elegant, intelligent female voice system_003: Ethan. A sunny, standard-pronunciation male voice

Retrieve all available voices using shell commands:

shell

uvx zai-tts -l

If you want to use custom voices, please complete voice cloning on the website audio.z.ai first.

Maintainer

aahl Core maintainer

Source details

Full Name: aahl/skills
Branch: main
Path in repo: skills/zai-tts
License: MIT License
Topics: agent-skills mcp ai-agents skills llm claude-skills home-assistant feishu lark

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

aahl/skills

qwen-asr

Transcribe audio files using Qwen ASR. Use when the user sends voice messages and wants them converted to text.

110 12

Explore

aahl/skills

maishou

商品价格全网对比技能，获取商品在淘宝(Taobao)、天猫(TMall)、京东(JD.com)、拼多多(PinDuoDuo)、抖音(Douyin)、快手(KaiShou)的最优价格、优惠券，当用户想购物或者获取优惠信息时使用。Get the best price, coupons for goods on Chinese e-commerce platforms, compare product prices, and use when users want to shop or get discount information.

110 12

Explore

aahl/skills

tianqi

查询中国地区的天气预报，支持15天及逐小时预报、各种生活指数

110 12

Explore

aahl/skills

mcp-lark

Based on FeiShu(飞书) / Lark's OpenAPI MCP server, manage user information, chats, emails, cloud documents, multidimensional tables, tasks, calendars, etc.

110 12

Explore

aahl/skills

edge-tts

Text-to-speech conversion using `uvx edge-tts` for generating audio from text. Use when (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.

110 12

Explore

aahl/skills