Agent skill

verify

Enforce evidence-based completion claims — require fresh command output before reporting success. Use when completing any task, fixing a bug, finishing a phase, running tests, building, deploying, or making any "it works" claim.

View SKILL.md on GitHub Repository

Stars 1,097

Forks 167

Install this agent skill to your Project

npx add-skill https://github.com/codeaholicguy/ai-devkit/tree/main/skills/verify

SKILL.md

Verify

Prove it works before saying it works.

Hard Rules

Do not claim completion without fresh terminal evidence from this session.
Forbidden words in completion claims: "should", "probably", "seems to", "likely", "I believe", "I think it works". These signal unverified assertions.
Cached, remembered, or previous-session output is not evidence. Run it again.

Gate Function

Every completion claim must pass all 5 steps in order:

Identify — What command proves this claim? If multiple commands are needed, run the gate once per command.
Run — Execute the full command now. No partial runs, no skipping.
Read — Read complete output. Check exit code. Count pass/fail.
Confirm — Does the output prove the exact claim?
Report — State the result, cite command, exit code, and key output.

If any step fails, stop. Fix the issue and restart from step 1.

If no verification command exists (e.g., no test suite), tell the user and ask them how to verify before claiming done.

Verification Patterns

Claim	Required Evidence	Not Sufficient
Tests pass	Test output: 0 failures, exit 0	Previous run, "should pass now"
Build succeeds	Build output: exit 0	Linter passing, partial build
Bug is fixed	Reproduce symptom → now passes	"Changed code, should be fixed"
Linter clean	Linter output: 0 errors	Single file check
Phase complete	Each criterion verified individually	"Tests pass, so done"
Feature works	E2E test or manual walkthrough	Unit tests alone

Regression Verification

For bug fixes, a single pass is not enough:

Write a test covering the bug.
Run → must pass (fix in place).
Revert the fix.
Run → must fail (proves test catches the bug).
Restore the fix.
Run → must pass.

If step 4 passes, the test is wrong. Rewrite it.

Red Flags and Rationalizations

Rationalization	Why It's Wrong	Do Instead
"This change is trivial"	Trivial changes break things constantly	Run the check
"I ran it earlier"	Code changed since then	Run it again now
"The test is flaky"	Flaky ≠ ignorable	Fix the flake first
"It compiles, so it works"	Compilation ≠ correctness	Run the tests
"The CI will catch it"	CI is a safety net, not a substitute	Verify locally first
"The agent said it's done"	Agent claims need verification too	Check diff and run tests

Memory Integration

After a failed verification, store the failure pattern: npx ai-devkit@latest memory store --title "<failure pattern>" --content "<what failed and how to avoid>" --tags "verify,failure-pattern"

Maintainer

codeaholicguy Core maintainer

Source details

Full Name: codeaholicguy/ai-devkit
Branch: main
Path in repo: skills/verify
Topics: ai claude-code agent-skills development antigravity cursor codex ai-assisted-development engineering prd engineering-enablement engineering-experience

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

codeaholicguy/ai-devkit

simplify-implementation

Analyze and simplify existing implementations to reduce complexity, improve maintainability, and enhance scalability. Use when users ask to simplify code, reduce complexity, refactor for readability, clean up implementations, improve maintainability, reduce technical debt, or make code easier to understand.

1,097 167

Explore

codeaholicguy/ai-devkit

dev-lifecycle

Structured SDLC workflow with 8 phases — requirements, design review, planning, implementation, testing, and code review. Use when the user wants to build a feature end-to-end, or run any individual phase (new requirement, review requirements, review design, execute plan, update planning, check implementation, write tests, code review).

1,097 167

Explore

codeaholicguy/ai-devkit

tdd

Test-driven development — write a failing test before writing production code. Use when implementing new functionality, adding behavior, or fixing bugs during active development.

1,097 167

Explore

codeaholicguy/ai-devkit

debug

Guide structured debugging before code changes by clarifying expected behavior, reproducing issues, identifying likely root causes, and agreeing on a fix plan with validation steps. Use when users ask to debug bugs, investigate regressions, triage incidents, diagnose failing behavior, handle failing tests, analyze production incidents, investigate error spikes, or run root cause analysis (RCA).

1,097 167

Explore

codeaholicguy/ai-devkit

memory

Use AI DevKit memory via CLI commands. Search before non-trivial work, store verified reusable knowledge, update stale entries, and avoid saving transcripts, secrets, or one-off task progress.

1,097 167

Explore

codeaholicguy/ai-devkit

technical-writer

Review and improve documentation for novice users. Use when users ask to review docs, improve documentation, audit README files, evaluate API docs, review guides, or improve technical writing.

1,097 167

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Verify

Hard Rules

Gate Function

Verification Patterns

Regression Verification

Red Flags and Rationalizations

Memory Integration

Recommended Agent Skills

simplify-implementation

dev-lifecycle

tdd

debug

memory

technical-writer