Agent skill

voice

Convert text to speech audio using mb voice CLI. Use when the user asks you to speak, say something aloud, generate audio, or produce a voice recording.

View SKILL.md on GitHub Repository

Stars 596

Forks 79

Install this agent skill to your Project

npx add-skill https://github.com/xvirobotics/metabot/tree/main/src/skills/voice

SKILL.md

Text-to-Speech (Voice Output)

Generate MP3 audio from text using the mb voice CLI.

Quick Commands

bash

# Generate MP3, prints file path to stdout
mb voice "Hello, this is a test"

# Generate and play immediately
mb voice "Hello" --play

# Save to specific file
mb voice "Hello" -o greeting.mp3

# Override provider and voice
mb voice "Hello" --provider doubao --voice zh_female_wanqudashu_moon_bigtts

# Pipe text (useful for long content)
echo "Long text here" | mb voice
echo "Long text" | mb voice -o output.mp3

When to Use

User asks you to "say", "speak", "read aloud", or "generate audio/voice"
User wants a voice recording or audio version of text
User requests TTS (text-to-speech) output

Available Providers & Voices

Edge TTS (default, free, no key needed):

zh-CN-XiaoyiNeural (default) — Female Chinese
zh-CN-YunxiNeural — Male Chinese
zh-CN-XiaoxiaoNeural — Female Chinese
en-US-JennyNeural — Female English

Doubao (default when Volcengine keys configured):

zh_female_wanqudashu_moon_bigtts (default) — Female Chinese
Other Volcengine voice IDs from the TTS console

OpenAI (when OPENAI_API_KEY set):

alloy (default), echo, fable, onyx, nova, shimmer

ElevenLabs (when ELEVENLABS_API_KEY set):

Voice IDs from the ElevenLabs console

Text Limits

Doubao: ~300 Chinese characters (longer text is auto-truncated)
OpenAI / ElevenLabs / Edge: ~4000 characters

Guidelines

For short text (greetings, alerts), use inline: mb voice "text"
For longer text, pipe through stdin: echo "..." | mb voice
The output file is MP3 format
Use --play only when the user explicitly wants to hear the audio (it blocks until playback completes)
When saving files for the user, use -o with a descriptive filename
To send the audio to the user in Feishu, copy the file to the outputs directory: cp /tmp/mb-voice-xxx.mp3 /tmp/metabot-outputs/<chatId>/

Maintainer

xvirobotics Core maintainer

Source details

Full Name: xvirobotics/metabot
Branch: main
Path in repo: src/skills/voice
License: MIT License
Topics: claude-code anthropic ai-coding openclaw ai-agent multi-agent chatbot feishu lark telegram-bot voice-assistant claude-agent-sdk feishu-bot agent-native self-improving agent-organization metamemory metaskill task-scheduler wechat-bot

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

xvirobotics/metabot

frontend-design

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

596 79

Explore

xvirobotics/metabot

doubao-tts

Generate high-quality speech audio using Doubao (豆包/Volcengine) TTS API. Use this skill when the user asks to generate audio, podcasts, voiceovers, or text-to-speech output.

596 79

Explore

xvirobotics/metabot

metamemory

Read and write shared memory documents. Use this when you need to save knowledge, notes, research findings, or project context for future reference across sessions. Also use it to look up previously stored information.

596 79

Explore

xvirobotics/metabot

metaskill

The meta-skill: create AI agent teams, individual agents, or custom skills for any project. Use when the user wants to generate a complete .claude/ agent team, create a single agent, or create a single skill.

596 79

Explore

xvirobotics/metabot

metabot

MetaBot HTTP API for agent collaboration: talk to other bots, schedule tasks, manage bots and peers. Use when the user wants to delegate work to another bot, schedule tasks, create/remove bots, or check peer status.

596 79

Explore

xvirobotics/metabot

skill-hub

Discover, search, and install shared skills from the Skill Hub registry. Use when the user wants to find available skills, share a skill with other bots, or install a skill from the hub.

596 79

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Text-to-Speech (Voice Output)

Quick Commands

When to Use

Available Providers & Voices

Text Limits

Guidelines

Recommended Agent Skills

frontend-design

doubao-tts

metamemory

metaskill

metabot

skill-hub