Agent skill

pr-review

Reviews code changes before merging. Use when reviewing PRs, checking staged changes, reviewing diffs, code review, merge readiness check, or validating changes before commit/push.

Stars 5
Forks 0

Install this agent skill to your Project

npx add-skill https://github.com/antoniocascais/claude-code-toolkit/tree/main/skills/pr-review

SKILL.md

PR Review Skill

Reviews code changes with focus on quality, security, and consistency.

Default Assumption: Public Repository

Unless explicitly stated otherwise, assume the repository is publicly available. This means:

  • Any secret, credential, or API key pushed is considered compromised
  • Internal URLs, IPs, hostnames should not be exposed
  • Comments with sensitive internal context should be flagged
  • Error messages should not leak internal architecture
  • Be extra cautious with .env files, config files, CI/CD configs

Phase 1: Determine Scope

STOP. Use AskUserQuestion before anything else.

Ask user to choose review scope:

  • Staged files only
  • Unstaged changes (working directory)
  • All uncommitted (staged + unstaged)
  • Current branch vs main (PR-style)
  • Specific commit or range
  • Other (specify)

Do NOT run any git commands or tools until user responds.

After selection, get the diff:

  • Staged: git diff --cached
  • Unstaged: git diff
  • All uncommitted: git diff HEAD
  • Branch vs main: git diff main...HEAD
  • Commit: git show <hash>
  • Range: git diff <from>..<to>

Also get changed files list: git diff --name-only <appropriate args>

Phase 2: Understand the Problem

STOP. Use AskUserQuestion to confirm before proceeding.

Infer intent from:

  1. Branch name: git branch --show-current
  2. Commit messages: git log main..HEAD --oneline (or relevant range)

Then use AskUserQuestion to confirm:

"Based on branch feature/xyz and commits, this PR appears to [inferred description]. Is this correct?"

  • Yes, proceed
  • No, let me explain

Do NOT proceed until user confirms.

Phase 3: Auto-Detect Stack

Check for presence of:

  • package.json / yarn.lock → Node.js
  • requirements.txt / pyproject.toml → Python
  • go.mod → Go
  • Cargo.toml → Rust
  • Dockerfile → Docker
  • *.tf → Terraform
  • *.yaml in k8s patterns → Kubernetes
  • .github/workflows/ → GitHub Actions

Note detected stack for context-aware analysis.

Phase 4: Run Scanners

Execute relevant scanners (skip silently if not installed):

Always run:

Tool Command
gitleaks gitleaks detect --source . --verbose --no-git
trufflehog trufflehog filesystem . --only-verified

Stack-specific:

Stack Tool Command
Node.js npm audit npm audit --json
Node.js yarn audit yarn audit --json
Python pip-audit pip-audit
Python safety safety check
Docker trivy trivy fs .
Docker hadolint hadolint Dockerfile
Terraform tfsec tfsec .
Terraform checkov checkov -d .
Terraform trivy trivy config .
K8s trivy trivy config .
Shell scripts shellcheck shellcheck <file>

Phase 5: Code Review

Analyze the diff for all categories. Be pragmatic—flag likely issues, skip obvious false positives.

5.1 Code Quality

  • Best practices for detected stack
  • Readability and maintainability
  • Error handling appropriateness
  • Test coverage (if tests exist)
  • Idiomatic patterns
  • Type safety issues

5.2 Codebase Consistency

  • Match existing patterns in the repo
  • Naming conventions alignment
  • File organization consistency
  • Don't introduce a 10th way of doing something

5.3 Security

Manual checks:

  • Hardcoded secrets, API keys, passwords, connection strings
  • SQL injection, XSS, command injection vectors
  • Path traversal risks
  • Auth/authz bypasses
  • Insecure defaults (http vs https, weak crypto)
  • Sensitive data in logs/errors/URLs
  • Container: running as root, privileged mode, unverified base images

5.4 Bug Detection

  • Logic errors, off-by-one
  • Null/undefined handling
  • Race conditions
  • Resource leaks (unclosed handles, connections)
  • Breaking changes to existing APIs

5.5 Dependencies

  • Known vulnerable package versions
  • Outdated dependencies with security patches
  • Unpinned versions
  • Suspicious or typosquatted package names

5.6 Performance

  • N+1 query patterns
  • Sync operations in async contexts
  • Unbounded loops/recursion
  • Memory leaks
  • Missing pagination
  • Blocking I/O in hot paths

5.7 Deprecations & Drift

  • Deprecated APIs, functions, patterns
  • Breaking changes in dependencies
  • Hardcoded values that should be variables
  • Environment-specific configs in shared code
  • Configuration diverging from IaC patterns

Phase 6: Report

Output a succinct markdown report:

markdown
## PR Review: [brief title]

**Problem:** [1-2 sentences on what this PR solves]

**Scope:** [staged/branch/commits reviewed]

**Stack:** [detected tech stack]

### Scanner Results
| Tool | Result |
|------|--------|
| gitleaks | [clean/N findings] |
| ... | ... |

### Findings

#### CRITICAL
- `file:line` - [issue with brief context]

#### HIGH
- `file:line` - [issue]

#### MEDIUM
- `file:line` - [issue]

#### LOW
- `file:line` - [issue]

### Summary
- Critical: X | High: X | Medium: X | Low: X

### Review Score: X/20
[One sentence justification]

### Action Required
| Priority | Item |
|----------|------|
| blocker | ... |
| should fix | ... |
| consider | ... |

Rating Scale

Score Meaning Action
0-10 Blocker issues Reject, needs significant rework
11-15 Acceptable Merge after addressing fixes
16-17 Good Ready to merge, suggestions optional
18-20 Excellent Merge immediately

Style Guidelines

Keep findings concise but contextual:

  • Bad: "should use https here"

  • Good: "http exposes data in transit, use https"

  • Bad: "fix this null check"

  • Good: "user.email accessed without null check - crashes if user not found"

Don't write a 50-page report. Focus on what matters.

Expand your agent's capabilities with these related and highly-rated skills.

antoniocascais/claude-code-toolkit

test-quality

Guides strong, effective unit test generation using proven testing techniques. Use when writing unit tests, reviewing test quality, improving existing tests, generating test cases, checking test coverage strength, or when tests exist but may be weak. Triggers on: unit test, test quality, test coverage, write tests, improve tests, review tests, test strength, mutation testing, boundary testing.

5 0
Explore
antoniocascais/claude-code-toolkit

skill-forge

Creates new Claude Code skills with proper structure and best practices. Use when user wants to create a skill, update an existing skill, add a new command, scaffold a workflow, define skill hooks, or asks "how do I make a skill".

5 0
Explore
antoniocascais/claude-code-toolkit

workflow-review

Reviews Claude Code sessions and proposes workflow improvements. Use when: (1) /workflow-review command, (2) "review my workflow", "how can I improve", (3) after long sessions when nudged, (4) start of session with pending review. Analyzes tool usage patterns, CLAUDE.md configuration, and compares against CC best practices. Proposes: CLAUDE.md updates, new skills, underused CC features. Saves session summaries to .claude/workflow-reviews/ for cross-session continuity.

5 0
Explore
antoniocascais/claude-code-toolkit

git-commit

Plans and executes git commits with optional TICKET_ID prefix. Analyzes staged changes, proposes optimal commit structure (single or multiple), generates descriptive messages with technical context, and executes after user approval. Use when committing code changes, creating atomic commits, or splitting large changesets.

5 0
Explore
antoniocascais/claude-code-toolkit

clarice

Conducts realistic mock interviews with detailed feedback and scoring. Use for interview prep, behavioral questions, technical interviews, STAR practice, system design interviews, or interview coaching.

5 0
Explore
antoniocascais/claude-code-toolkit

voice-mode

Activates voice conversation mode using Pocket TTS Docker container. Use when user says "voice mode", "let's talk", "talk to me", "speak your responses", or wants Claude to respond with spoken audio. Speaks all responses through TTS and plays via speakers.

5 0
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results