Agent skill

muapi-cinema-director

Direct high-fidelity cinematic video with AI — translates creative intent into technical cinematographic directives for Veo3, Kling, and Luma video models via muapi.ai

Stars 3,018
Forks 333

Install this agent skill to your Project

npx add-skill https://github.com/SamurAIGPT/Generative-Media-Skills/tree/main/library/motion/cinema-director

SKILL.md

🎬 AI Cinema Director Skill

A specialized skill for AI Agents to direct high-fidelity cinematic video. The Cinema Director skill translates high-level creative intent into technical cinematographic directives for state-of-the-art video models (Veo3, Kling, Luma).

Core Competencies

  1. Shot Composition Analysis: Mapping emotional beats to appropriate framing (e.g., Extreme Close-Up for intimacy, Wide Shot for isolation).
  2. Camera Movement Orchestration: Directing complex physical movements (Dolly, Truck, Crane) and lens-based effects (Rack Focus, Dolly Zoom).
  3. Lighting & Atmosphere Design: Specifying temporal and stylistic lighting (Golden Hour, Chiaroscuro, Volumetric God Rays).
  4. Technical Parameter Optimization: Automatically selecting optimal frame rates, aspect ratios, and model-specific biases.

🏗️ Technical Specification

1. Intent Mapping Table

Creative Intent Framing Movement Lighting
Heroic Reveal Low Angle / Wide Crane Up / Orbit Rim Lighting / High Contrast
Tense/Uneasy Dutch Angle Handheld Shake Low Key / Harsh Shadows
Introspective Close-Up Slow Push In Soft Rembrandt / Window Light
Majestic/Epic Extreme Wide Drone Flyover Golden Hour / Volumetric
Melancholic Profile / Medium Slow Pull Out Blue Hour / Desaturated

2. Physical Camera Movements

  • Dolly In/Out: Physical camera movement on a track toward/away from the subject.
  • Truck Left/Right: Lateral physical movement.
  • Crane/Jib: Sweeping vertical movement from a height.
  • Orbit: Circular movement around a center point.
  • Pedestal: Vertical elevation change (without tilting).

3. Lens & Optical Controls

  • Shallow DOF: Background blur (Bokeh).
  • Anamorphic: Horizontal flares and wide-screen cinematic feel.
  • Rack Focus: Shifting focus between planes within the shot.

🧠 Prompt Optimization Protocol (Agent Instruction)

Before calling the script, the Agent MUST expand the user's raw prompt using these Director's Rules:

  1. TECHNICAL INFUSION: Transform "cool action" into a technical shot.
    • Formula: [Shot Type] + [Subject/Action] + [Environment] + [Lighting] + [Camera Movement] + [Lens Effect]
  2. MOTION DYNAMICS: Use cinematic verbs: Dolly In (intimacy), Crane Up (majestic), Orbit (heroic), Truck (parallel motion).
  3. LIGHTING RECIPES: Apply specific illumination: Volumetric God Rays, Teal-and-Orange Grade, Cyberpunk Rim Lighting, Rembrandt Portrait Lighting.
  4. PHYSICS LOGIC: Describe light relationships (e.g., "Neon reflections shimmering on rain-slicked asphalt") to trigger model reasoning.

🚀 Protocol: Using the Cinema Director

Step 1: Define the Creative Brief

Provide the agent with a subject and a "Director's Intent."

Step 2: Invoke the Script

The generate-film.sh script accepts a --brief which it expands using its internal knowledge of cinematography.

bash
# Directing a scene
bash scripts/generate-film.sh 
  --subject "A lone samurai in a blizzard" 
  --intent "epic reveal" 
  --model "kling-master"

Step 3: Handle the Async Response

Video generation is asynchronous. Use the returned request_id to poll for completion via core/platform/check-result.sh.


⚠️ Constraints & Guardrails

  • Temporal Consistency: Avoid complex subject transformations in a single shot (e.g., "man turns into a bird").
  • Movement Collisions: Do not combine contradictory movements (e.g., "Dolly In" and "Dolly Out" simultaneously).
  • Physical Realism: Prefer movements possible with real-world equipment for a more professional "film" look.
  • Model Bias:
    • Veo3: Best for slow, high-quality aesthetic shots.
    • Kling: Best for complex character motion and physics.
    • Luma: Best for fast-paced, high-action cinematic sequences.

⚙️ Implementation Details

This skill acts as an "Expert Translator" for the core/media/generate-video.sh primitive. It maintains a dictionary of cinematic styles and injects technical directives into the prompt before execution.

Expand your agent's capabilities with these related and highly-rated skills.

SamurAIGPT/Generative-Media-Skills

muapi-platform

Setup and utility scripts for muapi.ai — configure API keys, test connectivity, and poll for async generation results

3,018 333
Explore
SamurAIGPT/Generative-Media-Skills

muapi-media-generation

Generate AI images, videos, music, and audio from the terminal via muapi.ai — supports 100+ models including Flux, Midjourney v7, Kling 3.0, Veo3, and Suno V5

3,018 333
Explore
SamurAIGPT/Generative-Media-Skills

muapi-media-editing

Edit and enhance images and videos with AI via muapi.ai — prompt-based editing, upscaling, background removal, face swap, lipsync, video effects, and more

3,018 333
Explore
SamurAIGPT/Generative-Media-Skills

muapi-seedance-2

Expert Cinema Director skill for Seedance 2.0 (ByteDance) — high-fidelity video generation using technical camera grammar and multimodal references. Supports text-to-video, image-to-video, video extension, beat-matching, dialogue, and e-commerce patterns.

3,018 333
Explore
SamurAIGPT/Generative-Media-Skills

muapi-workflow

Build, run, and visualize multi-step AI generation workflows. The AI architect translates natural language descriptions into connected node graphs — chain image generation, video creation, enhancement, and editing into automated pipelines.

3,018 333
Explore
SamurAIGPT/Generative-Media-Skills

muapi-ui-design

Generate high-fidelity UI/UX mockups for mobile and web apps using Atomic Design principles — creates wireframes and design systems via muapi.ai

3,018 333
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results