Agent skill
Transcribe Vaam Video with Gemini
Transcribe Vaam videos using Google Gemini AI. Takes a Vaam share URL, downloads the video, and returns a full text transcription. Supports any language without translation.
Install this agent skill to your Project
npx add-skill https://github.com/majiayu000/claude-skill-registry/tree/main/skills/data/transcribe-vaam-video-with-gemini
SKILL.md
IMPORTANT - Path Resolution:
This skill can be installed in different locations. Before executing any commands, determine the skill directory based on where you loaded this SKILL.md file, and use that path in all commands below. Replace $SKILL_DIR with the actual discovered path.
Transcribe Vaam Video with Gemini
Transcribe Vaam video recordings using Google Gemini AI. Provide a Vaam share URL and get back a complete text transcription.
Setup (First Time Only)
cd $SKILL_DIR && bun install
This installs the required dependencies (Google Generative AI SDK).
Requirements
Environment Variable:
GEMINI_API_KEY- Required. Your Google Gemini API key.- Get one at: https://aistudio.google.com/apikey
Usage
cd $SKILL_DIR && bun lib/transcribe-with-gemini.ts <vaam-url>
Options
| Option | Description |
|---|---|
--help, -h |
Show help message |
--verbose, -v |
Enable progress logging (only use for debugging) |
Examples
# Basic transcription
cd $SKILL_DIR && bun lib/transcribe-with-gemini.ts https://app.vaam.io/share/abc123
# With verbose output (shows progress)
cd $SKILL_DIR && bun lib/transcribe-with-gemini.ts --verbose https://app.vaam.io/share/abc123
Input
- Vaam Share URL: Must be in format
https://app.vaam.io/share/[id]- The ID can contain letters, numbers, and hyphens
Output
On Success:
- Plain text transcription to stdout
- Exit code: 0
On Error:
- JSON error object to stdout with structure:
json
{ "success": false, "error": { "code": "ERROR_CODE", "message": "Human-readable message", "details": "Technical details", "suggestion": "How to fix it" } } - Exit code: 1 (validation errors) or 2 (runtime errors)
Error Codes
| Code | Description | Exit Code |
|---|---|---|
MISSING_ARGUMENT |
No URL provided | 1 |
INVALID_URL |
URL doesn't match expected format | 1 |
MISSING_API_KEY |
GEMINI_API_KEY not set | 1 |
VIDEO_EXTRACTION_FAILED |
Couldn't get video from Vaam | 2 |
VIDEO_DOWNLOAD_FAILED |
Couldn't download video file | 2 |
TRANSCRIPTION_FAILED |
Gemini API error | 2 |
How It Works
- Parses the Vaam share URL to extract the capture ID
- Fetches the video URL from Vaam's API
- Downloads the video to a temporary file
- Sends the video to Gemini for transcription
- Returns the transcription text
- Cleans up temporary files
Notes
- Videos are transcribed in their original language (no automatic translation)
- Includes spoken content and important visual information shown on screen
- Timestamps are not included in the transcription
- Temporary files are automatically cleaned up after processing
- Gemini supports videos up to 2GB
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
agent-ops-spec
Manage specification documents in .agent/specs/. Use when user provides requirements, acceptance criteria, or feature descriptions that need to be tracked and validated against implementation.
agent-ops-state
Maintain .agent state files. Use at session start, after meaningful steps, and before concluding: read/update constitution/memory/focus/issues/baseline consistently.
agent-ops-spec
Manage specification documents in .agent/specs/. Use when user provides requirements, acceptance criteria, or feature descriptions that need to be tracked and validated against implementation.
agent-ops-testing
Test strategy, execution, and coverage analysis. Use when designing tests, running test suites, or analyzing test results beyond baseline checks.
agent-ops-testing
Test strategy, execution, and coverage analysis. Use when designing tests, running test suites, or analyzing test results beyond baseline checks.
agent-ops-state
Maintain .agent state files. Use at session start, after meaningful steps, and before concluding: read/update constitution/memory/focus/issues/baseline consistently.
Didn't find tool you were looking for?