Agent skill

verification-before-completion

Verification discipline for completion claims. Use when about to assert success, claim a fix is complete, report tests passing, or before commits and PRs. Enforces evidence-first workflow.

Stars 17
Forks 1

Install this agent skill to your Project

npx add-skill https://github.com/CodingCossack/agent-skills-library/tree/main/skills/verification-before-completion

SKILL.md

Verification Before Completion

NO COMPLETION CLAIMS WITHOUT FRESH VERIFICATION EVIDENCE

Core Protocol

Evidence before claims, always. If you haven't run the verification command in this message, you cannot claim it passes.

BEFORE any completion claim:
1. IDENTIFY: What verification command proves this claim?
2. RUN: Execute the FULL command (fresh, complete)
3. READ: Full output, check exit code, count failures
4. VERIFY: Does output confirm the claim?
   - NO → State actual status with evidence
   - YES → State claim WITH evidence
5. ONLY THEN: Make the claim

Command Selection

When multiple verification options exist (mono-repo, multiple suites):

  • Run the most specific command that covers the changed code
  • When uncertain, run the broadest command (full test suite > single file)
  • Lint ≠ build ≠ test — each verifies different claims

Evidence Format

✅ Ran: npm test
   Exit: 0
   Result: 47 passed, 0 failed
   "All tests pass."

❌ "Tests should pass now" (no command output)

Verification Requirements by Claim Type

Claim Required Evidence Insufficient
Tests pass Test output: 0 failures Previous run, "should pass"
Linter clean Linter output: 0 errors Partial check, extrapolation
Build succeeds Build exit code: 0 Linter passing
Bug fixed Original symptom test passes Code changed
Regression test Red-green cycle verified Single green
Agent completed VCS diff shows changes Agent "success" report
Requirements met Line-by-line checklist Tests passing

Red Flags — STOP

  • Words: "should", "probably", "seems to"
  • Satisfaction before verification: "Great!", "Perfect!", "Done!"
  • About to commit/push/PR without verification
  • Trusting agent success reports
  • Partial verification
  • ANY wording implying success without verification output

Rationalization Prevention

Excuse Response
"Should work now" Run the verification
"I'm confident" Confidence ≠ evidence
"Just this once" No exceptions
"Linter passed" Linter ≠ build
"Agent said success" Verify independently
"Partial check enough" Partial proves nothing

Key Patterns

Tests:

✅ [Run test] → [See: 34/34 pass] → "All tests pass"
❌ "Should pass now"

Regression (TDD):

✅ Write → Run (pass) → Revert fix → Run (MUST FAIL) → Restore → Run (pass)
❌ "Wrote regression test" (no red-green)

Requirements:

✅ Re-read plan → Checklist each item → Report gaps or completion
❌ "Tests pass, phase complete"

Agent delegation:

✅ Agent reports → Check VCS diff → Verify changes → Report actual state
❌ Trust agent report

Expand your agent's capabilities with these related and highly-rated skills.

CodingCossack/agent-skills-library

brainstorming

Collaborative design exploration that refines ideas into validated specs through iterative questioning. Use before any creative work including creating features, building components, adding functionality, or modifying behavior.

17 1
Explore
CodingCossack/agent-skills-library

test-driven-development

Red-green-refactor development methodology requiring verified test coverage. Use for feature implementation, bugfixes, refactoring, or any behavior changes where tests must prove correctness.

17 1
Explore
CodingCossack/agent-skills-library

using-superpowers

Meta-skill enforcing skill discovery and invocation discipline through mandatory workflows. Use when starting any conversation to check for relevant skills before any response, ensuring skill-first workflow before proceeding.

17 1
Explore
CodingCossack/agent-skills-library

requesting-code-review

Use when you need to request a code review for a PR/MR and want a consistent review brief (context, scope, risk areas, test instructions, acceptance criteria) before merge.

17 1
Explore
CodingCossack/agent-skills-library

writing-plans

Structured implementation planning for multi-step development tasks. Use when you have a spec or requirements and need to break work into executable steps.

17 1
Explore
CodingCossack/agent-skills-library

systematic-debugging

Root cause analysis for debugging. Use when bugs, test failures, or unexpected behavior have non-obvious causes, or after multiple fix attempts have failed.

17 1
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results