Agent skill

media-processing

Process multimedia files with FFmpeg (video/audio encoding, conversion, streaming, filtering, hardware acceleration), ImageMagick (image manipulation, format conversion, batch processing, effects, composition), and RMBG (AI-powered background removal). Use when converting media formats, encoding videos with specific codecs (H.264, H.265, VP9), resizing/cropping images, removing backgrounds from images, extracting audio from video, applying filters and effects, optimizing file sizes, creating streaming manifests (HLS/DASH), generating thumbnails, batch processing images, creating composite images, or implementing media processing pipelines. Supports 100+ formats, hardware acceleration (NVENC, QSV), and complex filtergraphs.

Stars 65
Forks 23

Install this agent skill to your Project

npx add-skill https://github.com/binjuhor/shadcn-lar/tree/main/.claude/skills/media-processing

SKILL.md

Media Processing Skill

Process video, audio, and images using FFmpeg, ImageMagick, and RMBG CLI tools.

Tool Selection

Task Tool Reason
Video encoding/conversion FFmpeg Native codec support, streaming
Audio extraction/conversion FFmpeg Direct stream manipulation
Image resize/effects ImageMagick Optimized for still images
Background removal RMBG AI-powered, local processing
Batch images ImageMagick mogrify for in-place edits
Video thumbnails FFmpeg Frame extraction built-in
GIF creation FFmpeg/ImageMagick FFmpeg for video, ImageMagick for images

Installation

bash
# macOS
brew install ffmpeg imagemagick
npm install -g rmbg-cli

# Ubuntu/Debian
sudo apt-get install ffmpeg imagemagick
npm install -g rmbg-cli

# Verify
ffmpeg -version && magick -version && rmbg --version

Essential Commands

bash
# Video: Convert/re-encode
ffmpeg -i input.mkv -c copy output.mp4
ffmpeg -i input.avi -c:v libx264 -crf 22 -c:a aac output.mp4

# Video: Extract audio
ffmpeg -i video.mp4 -vn -c:a copy audio.m4a

# Image: Convert/resize
magick input.png output.jpg
magick input.jpg -resize 800x600 output.jpg

# Image: Batch resize
mogrify -resize 800x -quality 85 *.jpg

# Background removal
rmbg input.jpg                          # Basic (modnet)
rmbg input.jpg -m briaai -o output.png  # High quality
rmbg input.jpg -m u2netp -o output.png  # Fast

Key Parameters

FFmpeg:

  • -c:v libx264 - H.264 codec
  • -crf 22 - Quality (0-51, lower=better)
  • -preset slow - Speed/compression balance
  • -c:a aac - Audio codec

ImageMagick:

  • 800x600 - Fit within (maintains aspect)
  • 800x600^ - Fill (may crop)
  • -quality 85 - JPEG quality
  • -strip - Remove metadata

RMBG:

  • -m briaai - High quality model
  • -m u2netp - Fast model
  • -r 4096 - Max resolution

References

Detailed guides in references/:

  • ffmpeg-encoding.md - Codecs, quality, hardware acceleration
  • ffmpeg-streaming.md - HLS/DASH, live streaming
  • ffmpeg-filters.md - Filters, complex filtergraphs
  • imagemagick-editing.md - Effects, transformations
  • imagemagick-batch.md - Batch processing, parallel ops
  • rmbg-background-removal.md - AI models, CLI usage
  • common-workflows.md - Video optimization, responsive images, GIF creation
  • troubleshooting.md - Error fixes, performance tips
  • format-compatibility.md - Format support, codec recommendations

Expand your agent's capabilities with these related and highly-rated skills.

binjuhor/shadcn-lar

Debugging

Systematic debugging framework ensuring root cause investigation before fixes. Includes four-phase debugging process, backward call stack tracing, multi-layer validation, and verification protocols. Use when encountering bugs, test failures, unexpected behavior, performance issues, or before claiming work complete. Prevents random fixes, masks over symptoms, and false completion claims.

65 23
Explore
binjuhor/shadcn-lar

frontend-design-pro

Creates jaw-dropping, production-ready frontend interfaces AND delivers perfectly matched real photos (Unsplash/Pexels direct links) OR flawless custom image-generation prompts for hero images, backgrounds, and illustrations. Zero AI slop, zero fake URLs.

65 23
Explore
binjuhor/shadcn-lar

ui-ux-pro-max

Frontend UI/UX design intelligence - activate FIRST when user requests beautiful, stunning, gorgeous, or aesthetic interfaces. The primary skill for design decisions before implementation. 50 styles, 21 palettes, 50 font pairings, 20 charts, 8 stacks (React, Next.js, Vue, Svelte, SwiftUI, React Native, Flutter, Tailwind). Actions: plan, build, create, design, implement, review, fix, improve, optimize, enhance, refactor, check frontend UI/UX code. Projects: website, landing page, dashboard, admin panel, e-commerce, SaaS, portfolio, blog, mobile app, .html, .tsx, .vue, .svelte. Elements: button, modal, navbar, sidebar, card, table, form, chart. Styles: glassmorphism, claymorphism, minimalism, brutalism, neumorphism, bento grid, dark mode, responsive, skeuomorphism, flat design. Topics: color palette, accessibility, animation, layout, typography, font pairing, spacing, hover, shadow, gradient.

65 23
Explore
binjuhor/shadcn-lar

mcp-builder

Guide for creating high-quality MCP (Model Context Protocol) servers that enable LLMs to interact with external services through well-designed tools. Use when building MCP servers to integrate external APIs or services, whether in Python (FastMCP) or Node/TypeScript (MCP SDK).

65 23
Explore
binjuhor/shadcn-lar

planning

Use when you need to plan technical solutions that are scalable, secure, and maintainable.

65 23
Explore
binjuhor/shadcn-lar

Problem-Solving Techniques

Apply systematic problem-solving techniques for complexity spirals (simplification cascades), innovation blocks (collision-zone thinking), recurring patterns (meta-pattern recognition), assumption constraints (inversion exercise), scale uncertainty (scale game), and dispatch when stuck. Techniques derived from Microsoft Amplifier project patterns adapted for immediate application.

65 23
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results