Agent skill

screenshot-annotator

Add manual-style annotations (red boxes, arrows, callouts, highlights) to screenshots for technical documentation. Use when creating user manuals, tutorials, or guides that need visual indicators pointing to UI elements.

Stars 163
Forks 31

Install this agent skill to your Project

npx add-skill https://github.com/majiayu000/claude-skill-registry/tree/main/skills/data/screenshot-annotator

SKILL.md

Screenshot Annotator

Add annotations to screenshots without modifying the original image. Annotations are overlaid on top.

Workflow

  1. Analyze the screenshot to identify the target element
  2. Generate annotation overlay using Gemini Vision API
  3. Output annotated image as a separate file

Usage

bash
python scripts/annotate.py "{image_path}" "{instruction}" --style "{style}" --text "{label}" --output "{output_path}"

Parameters

Parameter Required Default Description
image_path Yes - Path to screenshot
instruction Yes - What to annotate (e.g., "the Login button")
--style No red_box Annotation style
--text No - Text label to add
--output No auto Output path

Styles

Style Description
red_box Red rectangle + arrow (default)
arrow Red arrow pointing to element
callout Speech bubble with text
highlight Semi-transparent yellow overlay
circle Red circle around element
number Numbered marker for steps

Examples

bash
# Basic annotation
python scripts/annotate.py "login.png" "the Login button"

# With text label
python scripts/annotate.py "settings.png" "the gear icon" --text "Click here"

# Callout style
python scripts/annotate.py "form.png" "email field" --style callout --text "Enter your email"

Requirements

  • GEMINI_API_KEY or GOOGLE_API_KEY in environment
  • Python packages: google-genai, Pillow, python-dotenv

Expand your agent's capabilities with these related and highly-rated skills.

Didn't find tool you were looking for?

Be as detailed as possible for better results