Agent skill
openai-whisper-api
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Stars
355,710
Forks
72,004
Install this agent skill to your Project
npx add-skill https://github.com/openclaw/openclaw/tree/main/skills/openai-whisper-api
Metadata
Additional technical details for this skill
- openclaw
-
{ "emoji": "\ud83c\udf10", "install": [ { "id": "brew", "bins": [ "curl" ], "kind": "brew", "label": "Install curl (brew)", "formula": "curl" } ], "requires": { "env": [ "OPENAI_API_KEY" ], "bins": [ "curl" ] }, "primaryEnv": "OPENAI_API_KEY" }
SKILL.md
OpenAI Whisper API (curl)
Transcribe an audio file via OpenAI’s /v1/audio/transcriptions endpoint. Set OPENAI_BASE_URL to use an OpenAI-compatible proxy or local gateway.
Quick start
bash
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a
Defaults:
- Model:
whisper-1 - Output:
<input>.txt
Useful flags
bash
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --model whisper-1 --out /tmp/transcript.txt
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language en
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --prompt "Speaker names: Peter, Daniel"
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/transcript.json
API key
Set OPENAI_API_KEY, or configure it in ~/.openclaw/openclaw.json. Optionally set OPENAI_BASE_URL (for example http://127.0.0.1:51805/v1) to use an OpenAI-compatible proxy or local gateway:
json5
{
skills: {
"openai-whisper-api": {
apiKey: "OPENAI_KEY_HERE",
},
},
}
Didn't find tool you were looking for?