Agent skill

openai-whisper-api

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

Stars 355,710
Forks 72,004

Install this agent skill to your Project

npx add-skill https://github.com/openclaw/openclaw/tree/main/skills/openai-whisper-api

Metadata

Additional technical details for this skill

openclaw
{
    "emoji": "\ud83c\udf10",
    "install": [
        {
            "id": "brew",
            "bins": [
                "curl"
            ],
            "kind": "brew",
            "label": "Install curl (brew)",
            "formula": "curl"
        }
    ],
    "requires": {
        "env": [
            "OPENAI_API_KEY"
        ],
        "bins": [
            "curl"
        ]
    },
    "primaryEnv": "OPENAI_API_KEY"
}

SKILL.md

OpenAI Whisper API (curl)

Transcribe an audio file via OpenAI’s /v1/audio/transcriptions endpoint. Set OPENAI_BASE_URL to use an OpenAI-compatible proxy or local gateway.

Quick start

bash
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a

Defaults:

  • Model: whisper-1
  • Output: <input>.txt

Useful flags

bash
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --model whisper-1 --out /tmp/transcript.txt
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language en
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --prompt "Speaker names: Peter, Daniel"
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/transcript.json

API key

Set OPENAI_API_KEY, or configure it in ~/.openclaw/openclaw.json. Optionally set OPENAI_BASE_URL (for example http://127.0.0.1:51805/v1) to use an OpenAI-compatible proxy or local gateway:

json5
{
  skills: {
    "openai-whisper-api": {
      apiKey: "OPENAI_KEY_HERE",
    },
  },
}

Didn't find tool you were looking for?

Be as detailed as possible for better results