Agent skill
minimax
MiniMax API for AI models. Use when user mentions "MiniMax", "Chinese AI", or asks about MiniMax language models.
Install this agent skill to your Project
npx add-skill https://github.com/vm0-ai/vm0-skills/tree/main/minimax
SKILL.md
MiniMax API
Use the MiniMax API via direct curl calls for AI chat completion, text-to-speech, and video generation.
Official docs:
https://platform.minimax.io/docs
When to Use
Use this skill when you need to:
- Chat completion with Chinese-optimized LLM (MiniMax-M1/M2)
- Text-to-speech with natural voices and emotion control
- Video generation from text prompts (T2V)
- Image-to-video conversion (I2V)
Prerequisites
- Sign up at MiniMax Platform
- Go to Account Management > API Keys to create an API key
- Note: Global users should use
api.minimaxi.chat(with extra "i")
export MINIMAX_API_KEY="your-api-key"
API Hosts
| Region | Base URL |
|---|---|
| China | https://api.minimax.io |
| Global | https://api.minimaxi.chat |
How to Use
All examples below assume you have MINIMAX_API_KEY set.
Authentication uses Bearer token in the Authorization header.
1. Basic Chat Completion
Send a chat message:
Write to /tmp/minimax_request.json:
{
"model": "MiniMax-Text-01",
"messages": [
{"role": "system", "content": "You are a helpful assistant."},
{"role": "user", "content": "Hello, who are you?"}
]
}
Then run:
curl -s "https://api.minimax.io/v1/text/chatcompletion_v2" -X POST -H "Authorization: Bearer $(printenv MINIMAX_API_KEY)" -H "Content-Type: application/json" -d @/tmp/minimax_request.json | jq '.choices[0].message.content'
Available models:
MiniMax-M2: Reasoning model (best quality)MiniMax-M1: Reasoning model (balanced)MiniMax-Text-01: Standard model (fastest)
2. Chat with Temperature Control
Adjust creativity:
Write to /tmp/minimax_request.json:
{
"model": "MiniMax-Text-01",
"messages": [
{"role": "user", "content": "Write a short poem about AI."}
],
"temperature": 0.7,
"max_tokens": 200
}
Then run:
curl -s "https://api.minimax.io/v1/text/chatcompletion_v2" -X POST -H "Authorization: Bearer $(printenv MINIMAX_API_KEY)" -H "Content-Type: application/json" -d @/tmp/minimax_request.json | jq '.choices[0].message.content'
Parameters:
temperature(0-1): Higher = more creativetop_p(0-1, default 0.95): Sampling diversitymax_tokens: Maximum output tokens
3. Streaming Response
Get real-time output:
Write to /tmp/minimax_request.json:
{
"model": "MiniMax-M1",
"messages": [
{"role": "user", "content": "Explain quantum computing."}
],
"stream": true
}
Then run:
curl -s "https://api.minimax.io/v1/text/chatcompletion_v2" -X POST -H "Authorization: Bearer $(printenv MINIMAX_API_KEY)" -H "Content-Type: application/json" -d @/tmp/minimax_request.json
Streaming is recommended for reasoning models (M1/M2).
4. Reasoning Model (M1/M2)
Use reasoning models for complex tasks:
Write to /tmp/minimax_request.json:
{
"model": "MiniMax-M1",
"messages": [
{"role": "user", "content": "Solve step by step: A train travels 120km in 2 hours. What is its average speed in m/s?"}
],
"stream": true
}
Then run:
curl -s "https://api.minimax.io/v1/text/chatcompletion_v2" -X POST -H "Authorization: Bearer $(printenv MINIMAX_API_KEY)" -H "Content-Type: application/json" -d @/tmp/minimax_request.json
Response includes reasoning_content field with thought process.
5. Text-to-Speech (Basic)
Convert text to speech:
Write to /tmp/minimax_request.json:
{
"model": "speech-02-hd",
"text": "Hello, this is a test of MiniMax text to speech.",
"voice_id": "male-qn-qingse",
"speed": 1.0,
"format": "mp3"
}
Then run:
curl -s "https://api.minimax.io/v1/t2a_v2" -X POST -H "Authorization: Bearer $(printenv MINIMAX_API_KEY)" -H "Content-Type: application/json" -d @/tmp/minimax_request.json --output speech.mp3
6. TTS with Emotion
Add emotion to speech (speech-02 models):
Write to /tmp/minimax_request.json:
{
"model": "speech-02-hd",
"text": "I am so happy to meet you today!",
"voice_id": "female-shaonv",
"emotion": "happy",
"speed": 1.0,
"format": "mp3"
}
Then run:
curl -s "https://api.minimax.io/v1/t2a_v2" -X POST -H "Authorization: Bearer $(printenv MINIMAX_API_KEY)" -H "Content-Type: application/json" -d @/tmp/minimax_request.json --output happy_speech.mp3
Emotion options: happy, sad, angry, fearful, disgusted, surprised, neutral
7. TTS with Audio Settings
Fine-tune audio output:
Write to /tmp/minimax_request.json:
{
"model": "speech-02-hd",
"text": "High quality audio test.",
"voice_id": "male-qn-qingse",
"speed": 1.0,
"vol": 1.0,
"pitch": 0,
"audio_sample_rate": 32000,
"bitrate": 128000,
"format": "mp3"
}
Then run:
curl -s "https://api.minimax.io/v1/t2a_v2" -X POST -H "Authorization: Bearer $(printenv MINIMAX_API_KEY)" -H "Content-Type: application/json" -d @/tmp/minimax_request.json --output hq_speech.mp3
TTS models:
speech-02-hd: High definition (best quality)speech-02-turbo: Fast generationspeech-01-hd: Previous gen HDspeech-01-turbo: Previous gen fast
8. Text-to-Video (T2V)
Generate video from text prompt:
Write to /tmp/minimax_request.json:
{
"model": "T2V-01-Director",
"prompt": "A cat playing with a ball of yarn [Static shot].",
"duration": 6,
"resolution": "1080P"
}
Then run:
curl -s "https://api.minimax.io/v1/video_generation" -X POST -H "Authorization: Bearer $(printenv MINIMAX_API_KEY)" -H "Content-Type: application/json" -d @/tmp/minimax_request.json | jq '.task_id'
Video generation is async - returns a task ID to poll for completion.
9. T2V with Camera Control
Control camera movement in videos:
Write to /tmp/minimax_request.json:
{
"model": "MiniMax-Hailuo-2.3",
"prompt": "A person walking through a forest [Tracking shot], then stops to look at a bird [Push in].",
"duration": 6,
"resolution": "1080P"
}
Then run:
curl -s "https://api.minimax.io/v1/video_generation" -X POST -H "Authorization: Bearer $(printenv MINIMAX_API_KEY)" -H "Content-Type: application/json" -d @/tmp/minimax_request.json | jq '.task_id'
Camera commands (in brackets):
- Movement:
Truck left/right,Pan left/right,Push in/Pull out - Vertical:
Pedestal up/down,Tilt up/down - Zoom:
Zoom in/out - Special:
Shake,Tracking shot,Static shot
Combine with [Pan left, Pedestal up] (max 3 simultaneous).
10. Image-to-Video (I2V)
Generate video from an image:
Note: For I2V, use
MiniMax-Hailuo-2.3orS2V-01model which supportsfirst_frame_image. TheT2V-01-Directormodel is text-to-video only.
Write to /tmp/minimax_request.json:
{
"model": "MiniMax-Hailuo-2.3",
"prompt": "The scene comes to life with gentle movement [Static shot].",
"first_frame_image": "https://example.com/image.jpg",
"duration": 6,
"resolution": "1080P"
}
Then run:
curl -s "https://api.minimax.io/v1/video_generation" -X POST -H "Authorization: Bearer $(printenv MINIMAX_API_KEY)" -H "Content-Type: application/json" -d @/tmp/minimax_request.json | jq '.task_id'
Provide first_frame_image as URL or base64-encoded image.
11. Function Calling (Tools)
Use tools with chat:
Write to /tmp/minimax_request.json:
{
"model": "MiniMax-Text-01",
"messages": [
{"role": "user", "content": "What is the weather in Beijing?"}
],
"tools": [
{
"type": "function",
"function": {
"name": "get_weather",
"description": "Get weather for a location",
"parameters": {
"type": "object",
"properties": {
"location": {"type": "string", "description": "City name"}
},
"required": ["location"]
}
}
}
],
"tool_choice": "auto"
}
Then run:
curl -s "https://api.minimax.io/v1/text/chatcompletion_v2" -X POST -H "Authorization: Bearer $(printenv MINIMAX_API_KEY)" -H "Content-Type: application/json" -d @/tmp/minimax_request.json | jq '.choices[0]'
Response Format
Chat Completion
{
"id": "string",
"choices": [{
"message": {
"role": "assistant",
"content": "Response text",
"reasoning_content": "Thought process (M1/M2 only)"
},
"finish_reason": "stop"
}],
"usage": {
"prompt_tokens": 10,
"completion_tokens": 50,
"total_tokens": 60
}
}
Guidelines
- Use correct host: China uses
api.minimax.io, global usesapi.minimaxi.chat - Streaming for reasoning: M1/M2 models work best with
stream: true - Camera syntax: Video commands go in
[brackets]within prompts - Emotion in TTS: Only works with
speech-02-*andspeech-01-*models - Async video: Video generation returns task ID - poll for completion
- Chinese optimized: MiniMax excels at Chinese language tasks
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
brave-search
Brave Search API for web search. Use when user says "search web", "Brave search", or asks to "find on web" without specifying Google.
supadata
Supadata API for YouTube/web data. Use when user mentions "Supadata", "YouTube data", "channel stats", or web scraping data.
roadmap-planning
Build and prioritize product roadmaps using scoring models like RICE, ICE, and value-effort matrices. Activate when creating a product roadmap, prioritizing features, sequencing initiatives, mapping dependencies, balancing team capacity, choosing between Now/Next/Later or quarterly planning, or communicating roadmap tradeoffs to executives and stakeholders.
qdrant
Qdrant API for vector search. Use when user mentions "Qdrant", "vector database", "semantic search", or embeddings storage.
calendly
Calendly scheduling API. Use when user mentions "Calendly", "calendly.com", "schedule a meeting", "booking link", "event types", or asks about interview scheduling.
stripe
Stripe API for payments. Use when user mentions "Stripe", "payment", "subscription", "billing", "invoice", or asks about payment processing.
Didn't find tool you were looking for?