Agent skill

genmedia-producer

Expert media production assistant. Use when requested to help with storyboarding, podcast creation, audio assembly, or complex multi-step media workflows using the GenMedia MCP servers (Veo, Lyria, Gemini TTS, NanoBanana).

Stars 1,034
Forks 330

Install this agent skill to your Project

npx add-skill https://github.com/GoogleCloudPlatform/vertex-ai-creative-studio/tree/main/experiments/mcp-genmedia/skills/genmedia-producer

Metadata

Additional technical details for this skill

SKILL.md

GenMedia Producer Skill

You are a highly capable media production assistant. Use this skill when asked to help with storyboarding, podcast creation, or complex multi-step media workflows using the Google GenMedia MCP servers.

Core Audio Production Workflow

  1. Script Preparation: Remove markdown formatting (*, #) and replace structure with spoken language.
  2. Generation: Gemini TTS is the preferred tool for high-fidelity speech synthesis. Use gemini_audio_tts for core synthesis. Fallback to chirp_tts for specialized voices. For long text, split into manageable chunks.
  3. Assembly: Use ffmpeg_concatenate_media_files to assemble mixed-source audio.
  4. Bumpers: Create 5-second intro/outro music using lyria_generate_music (with the lyria-3-clip-preview model), and ensure a smooth transition with afade.

Storyboarding

For video >8 seconds, construct a scene-by-scene narrative that can be segmented into 5-8 second clips. Use nanobanana_image_generation to create visual references for each scene.

Veo Video Generation (Veo 3.1)

  • Use the Five-Part Formula for prompts: Cinematography, Subject, Action, Context, and Style.
  • Soundstage Direction: Use quotation marks for dialogue and specific labels (e.g., [loud thunder]) for sound effects.
  • Advanced Modalities: Use veo_first_last_to_video for transitions, veo_ingredients_to_video for character/style consistency across scenes, and veo-3.1-lite-generate-001 for faster, 720p/1080p generation.
  • If a request times out, retry once. If it fails again, reduce the duration parameter and inform the user.
  • For voiceovers, ensure the video total runtime matches the audio duration (use ffmpeg_get_media_info).
  • The bucket parameter must be a full GCS URI (gs://...).

Expand your agent's capabilities with these related and highly-rated skills.

GoogleCloudPlatform/vertex-ai-creative-studio

genmedia-audio-engineer

Expert in audio synthesis, music generation, and mixing. Use when creating podcasts, background scores, or multi-track audio layering using mcp-chirp3-go, mcp-lyria-go, mcp-gemini-go, mcp-nanobanana-go, and mcp-avtool-go.

1,034 330
Explore
GoogleCloudPlatform/vertex-ai-creative-studio

agent-aware-cli

Guide for designing and implementing command-line interfaces (CLIs) that are equally usable by human developers and automated coding agents. Use when the user wants to build a CLI, apply CLI best practices, or use Go with Cobra and Viper.

1,034 330
Explore
GoogleCloudPlatform/vertex-ai-creative-studio

genmedia-voice-director

Expert in casting, directing, and generating expressive text-to-speech using Gemini TTS. Use this when the user needs virtual voice actor personas, expressive speech generation, or multiple variations of a voiceover (like "take 3 on the bounce").

1,034 330
Explore
GoogleCloudPlatform/vertex-ai-creative-studio

genmedia-image-artist

Expert in AI image generation and editing. Use when the user needs high-quality textures, character-consistent visuals, or image-to-image editing using mcp-nanobanana-go.

1,034 330
Explore
GoogleCloudPlatform/vertex-ai-creative-studio

genmedia-video-editor

Expert in video composition, editing, and format conversion. Use when the user wants to generate high-quality video, overlay images on video, concatenate clips, create GIFs, or sync audio to video using mcp-avtool-go and mcp-veo-go.

1,034 330
Explore
GoogleCloudPlatform/vertex-ai-creative-studio

genmedia-producer

Expert media production assistant. Use when requested to help with storyboarding, podcast creation, audio assembly, or using the GenMedia MCP tools (Veo, Lyria, Gemini TTS, NanoBanana).

1,034 330
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results