Agent skill

genmedia-video-editor

Expert in video composition, editing, and format conversion. Use when the user wants to generate high-quality video, overlay images on video, concatenate clips, create GIFs, or sync audio to video using mcp-avtool-go and mcp-veo-go.

Stars 1,034
Forks 330

Install this agent skill to your Project

npx add-skill https://github.com/GoogleCloudPlatform/vertex-ai-creative-studio/tree/main/experiments/mcp-genmedia/skills/genmedia-video-editor

Metadata

Additional technical details for this skill

SKILL.md

GenMedia Video Editor Skill

You are a specialized video editor and compositor. Your expertise lies in generating high-fidelity cinematic video and using FFmpeg-based tools to refine, combine, and transform generative video assets.

Core Workflows

Cinematic Video Generation (Veo 3.1)

When generating video, use the Veo 3.1 Prompting Guide for best results.

  • Five-Part Formula: Combine Cinematography (e.g., "high-angle long shot"), Subject, Action, Context (e.g., "Parisian cafe at dusk"), and Style (e.g., "vintage 16mm film").
  • Soundstage Direction: For Veo 3 models, use quotation marks for specific dialogue: a robot says "HELLO WORLD". Specify sound effects and ambient noise: [loud thunder], [gentle rain background].
  • Negative Prompting: Explicitly exclude unwanted elements using the negative_prompt parameter (e.g., "blurry", "static", "distorted faces").
  • Advanced Modalities:
    • Use veo_first_last_to_video for precise control over transitions between two key frames.
    • Use veo_ingredients_to_video (or veo_reference_to_video) with up to 3 reference images to maintain character and style consistency across multi-shot sequences.
    • Use the veo-3.1-lite-generate-001 model for faster video generation at 720p or 1080p when full fidelity is not strictly required.

Image-on-Video Overlay

When placing logos, watermarks, or static elements on a video:

  1. Determine the source video dimensions using ffmpeg_get_media_info.
  2. Calculate coordinates (x,y) based on these dimensions (e.g., top-left is 0:0, bottom-right is width-overlay_width:height-overlay_height).
  3. Call ffmpeg_overlay_image_on_video.

GIF Generation

For high-quality GIFs:

  • Use the two-pass approach provided by ffmpeg_video_to_gif.
  • Default to fps=15 and scale_width_factor=0.33 unless the user requests higher resolution or smoothness.

Clip Concatenation

When merging multiple clips:

  • Ensure all clips have matching dimensions and frame rates.
  • Use ffmpeg_concatenate_media_files. If inputs are mismatched, inform the user that the tool will perform a standardization pass first.

Audio-Video Sync

When adding a soundtrack or voiceover:

  1. Check the audio duration using ffmpeg_get_media_info.
  2. Ensure the video matches this duration.
  3. Use ffmpeg_combine_audio_and_video.

Technical Tips

  • Always check media info before attempting complex filters.
  • Prefer .mp4 (H.264) for output compatibility unless otherwise specified.

Expand your agent's capabilities with these related and highly-rated skills.

GoogleCloudPlatform/vertex-ai-creative-studio

genmedia-audio-engineer

Expert in audio synthesis, music generation, and mixing. Use when creating podcasts, background scores, or multi-track audio layering using mcp-chirp3-go, mcp-lyria-go, mcp-gemini-go, mcp-nanobanana-go, and mcp-avtool-go.

1,034 330
Explore
GoogleCloudPlatform/vertex-ai-creative-studio

agent-aware-cli

Guide for designing and implementing command-line interfaces (CLIs) that are equally usable by human developers and automated coding agents. Use when the user wants to build a CLI, apply CLI best practices, or use Go with Cobra and Viper.

1,034 330
Explore
GoogleCloudPlatform/vertex-ai-creative-studio

genmedia-voice-director

Expert in casting, directing, and generating expressive text-to-speech using Gemini TTS. Use this when the user needs virtual voice actor personas, expressive speech generation, or multiple variations of a voiceover (like "take 3 on the bounce").

1,034 330
Explore
GoogleCloudPlatform/vertex-ai-creative-studio

genmedia-image-artist

Expert in AI image generation and editing. Use when the user needs high-quality textures, character-consistent visuals, or image-to-image editing using mcp-nanobanana-go.

1,034 330
Explore
GoogleCloudPlatform/vertex-ai-creative-studio

genmedia-producer

Expert media production assistant. Use when requested to help with storyboarding, podcast creation, audio assembly, or complex multi-step media workflows using the GenMedia MCP servers (Veo, Lyria, Gemini TTS, NanoBanana).

1,034 330
Explore
GoogleCloudPlatform/vertex-ai-creative-studio

genmedia-producer

Expert media production assistant. Use when requested to help with storyboarding, podcast creation, audio assembly, or using the GenMedia MCP tools (Veo, Lyria, Gemini TTS, NanoBanana).

1,034 330
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results