Agent skill

screenshot-analyzer

Analyze screenshots for error diagnosis and tutorial generation using Gemini Vision API. Detects errors, suggests solutions, or generates step-by-step operation guides. Use when debugging from screenshots or creating how-to documentation.

Stars 163
Forks 31

Install this agent skill to your Project

npx add-skill https://github.com/majiayu000/claude-skill-registry/tree/main/skills/data/screenshot-analyzer

SKILL.md

Screenshot Analyzer

Analyze screenshots for error diagnosis or operation tutorial generation.

Modes

  1. Analyze: Detect errors, identify causes, suggest solutions
  2. Tutorial: Generate step-by-step operation guides with annotations

Usage

bash
# Error analysis (default)
python scripts/analyze.py "{screenshot}" --mode analyze

# Tutorial generation
python scripts/analyze.py "{screenshot}" --mode tutorial

Parameters

Parameter Required Default Description
screenshot Yes - Path to screenshot
--mode, -m No analyze Mode: analyze or tutorial
--output, -o No auto Output HTML path
--no-annotate No false Skip annotation generation

Output

Analyze Mode

  • Screen description
  • Error detection (message, type, location)
  • Root cause analysis
  • Solution suggestions (NextStep)

Tutorial Mode

  • Screen overview
  • Numbered operation steps
  • Annotated screenshots per step
  • Tips and warnings

Examples

bash
# Analyze error screenshot
python scripts/analyze.py "error_console.png"

# Generate tutorial
python scripts/analyze.py "settings_menu.png" --mode tutorial

# Specify output
python scripts/analyze.py "login.png" --mode tutorial --output "docs/login_guide.html"

# Skip annotations
python scripts/analyze.py "error.png" --no-annotate

Requirements

  • GEMINI_API_KEY or GOOGLE_API_KEY in environment
  • Python packages: google-genai, Pillow, python-dotenv

Expand your agent's capabilities with these related and highly-rated skills.

Didn't find tool you were looking for?

Be as detailed as possible for better results