verify-issue

This is the fourth and final step in the issue workflow:

define-issue - Creates well-defined issues with acceptance criteria
refine-issue - Breaks issues into single-file-focused sub-tasks with dependencies
execute-issue - Implements sub-tasks one at a time
verify-issue (this skill) - Validates implementation against acceptance criteria

<backend_detection> FIRST: Detect the backend from mobius config before proceeding.

Read ~/.config/mobius/config.yaml or mobius.config.yaml in the project root:

yaml

backend: linear  # or 'jira'

Default: If no backend is specified, default to linear.

The detected backend determines which MCP tools to use throughout this skill. All subsequent tool references use the backend-specific tools from the <backend_context> section below. </backend_detection>

<verification_config> Read verification configuration from mobius.config.yaml:

yaml

execution:
  verification:
    coverage_threshold: 80        # Default: 80%
    require_all_tests_pass: true  # Default: true
    performance_check: true       # Default: true
    security_check: true          # Default: true
    max_rework_iterations: 3      # Default: 3

If not specified, use defaults. These settings control the multi-agent verification behavior. </verification_config>

<autonomous_actions> CRITICAL: The following actions MUST be performed AUTONOMOUSLY without asking the user:

Reopen failing sub-tasks - When verification finds issues (FAIL or NEEDS_WORK), IMMEDIATELY:
- Add feedback comments to failing sub-tasks with specific file:line references
- Transition failing sub-tasks back to "To Do" status
- Do not ask "Should I reopen these sub-tasks?" - just do it
Post verification report - Always post the review comment to the ticket without asking.
Mark verification sub-task Done - On PASS or PASS_WITH_NOTES, automatically mark the current verification sub-task as Done.

The verify-issue skill is designed to run end-to-end autonomously. User interaction is only needed for:

Escalation after max_rework_iterations exceeded
Ambiguous requirements that need clarification (DISCUSS status)

Note: The verification sub-task is created by refine-issue during issue breakdown. When mobius executes a "Verification Gate" sub-task, it routes to this skill instead of execute-issue. </autonomous_actions>

<backend_context> MCP Tools for Linear:

Fetch issue: mcp__plugin_linear_linear__get_issue
- Parameters: id (issue ID), includeRelations (boolean)
- Returns: Issue with status, description, acceptance criteria
List comments: mcp__plugin_linear_linear__list_comments
- Parameters: issueId
- Returns: Array of comments with implementation notes
List sub-tasks: mcp__plugin_linear_linear__list_issues
- Parameters: parentId, includeArchived
- Returns: Array of child issues with status
Add comment: mcp__plugin_linear_linear__create_comment
- Parameters: issueId, body (markdown)
- Use for: Posting verification report
Update status: mcp__plugin_linear_linear__update_issue
- Parameters: id, state (e.g., "Done"), labels (optional)
- Use for: Marking issue as Done if verified, or adding "needs-revision" label
Create follow-up issue: mcp__plugin_linear_linear__create_issue
- Parameters: team, title, description, labels, relatedTo
- Use for: Creating follow-up issues for discovered problems

Fetch issue: mcp_plugin_atlassian_jira__get_issue
- Parameters: issueIdOrKey (e.g., "PROJ-123")
- Returns: Issue with status, description, acceptance criteria
List comments: mcp_plugin_atlassian_jira__get_comments
- Parameters: issueIdOrKey
- Returns: Array of comments with implementation notes
List sub-tasks: mcp_plugin_atlassian_jira__list_issues
- Parameters: jql (e.g., "parent = PROJ-123")
- Returns: Array of child issues with status
Add comment: mcp_plugin_atlassian_jira__add_comment
- Parameters: issueIdOrKey, body (Jira wiki markup or markdown)
- Use for: Posting verification report
Update status: mcp_plugin_atlassian_jira__transition_issue
- Parameters: issueIdOrKey, transitionId or transitionName
- Use for: Transitioning to Done if verified
Create follow-up issue: mcp_plugin_atlassian_jira__create_issue
- Parameters: projectKey, summary, description, issueType, labels
- Use for: Creating follow-up issues for discovered problems </backend_context>

The review adds a structured comment to the ticket documenting findings, making the verification visible to the team.

<quick_start> Pass the issue identifier:

/verify PROJ-123

Or invoke programmatically:

Skill: verify-issue
Args: PROJ-123

<parent_story_mode> When verify-issue is called on a verification sub-task (via mobius loop), operate in parent story mode:

Identify parent issue - Get the parent issue ID from the verification sub-task
Collect all sibling sub-tasks - Fetch using backend list tool with parentId filter
Separate implementation vs verification sub-tasks - Filter out the current verification sub-task from the list
Verify all implementation sub-tasks "Done" - If any implementation sub-task is not complete, output STATUS: ALL_BLOCKED and exit
Aggregate context:
- Acceptance criteria from parent + each implementation sub-task
- Implementation notes from all sub-task comments
- Files modified across all sub-tasks
- Coverage data from all test runs

Note: The verification sub-task is created by refine-issue during issue breakdown. This skill focuses on EXECUTING verification, not creating the sub-task.

Context aggregation:

markdown

# Aggregated Verification Context

## Parent Issue: {ID} - {Title}
{Parent description and acceptance criteria}

## Sub-Tasks Summary
| ID | Title | Status | Files Modified | Key Changes |
|----|-------|--------|----------------|-------------|
| ... | ... | Done | file1.ts, file2.ts | {summary} |

## Combined Acceptance Criteria
From parent:
- [ ] Parent criterion 1
- [ ] Parent criterion 2

From sub-tasks:
- [ ] {Sub-task 1 ID}: Criterion from sub-task
- [ ] {Sub-task 2 ID}: Criterion from sub-task

## All Modified Files
{Deduplicated list of all files changed across sub-tasks}

## Implementation Notes (aggregated from comments)
{Key decisions, constraints, and context from all sub-task comments}

</parent_story_mode>

<verification_subtask_context> Note: The verification sub-task is created by the refine-issue skill during issue breakdown, NOT by verify-issue.

When mobius loop encounters a "Verification Gate" sub-task (detected by title pattern), it routes execution to /verify-issue instead of /execute-issue.

Expected sub-task format (created by refine-issue):

Title: [{parent-id}] Verification Gate (MUST contain "Verification Gate")
Blocked by: All implementation sub-tasks
Labels: ["verification"] (optional)

Execution flow:

refine-issue creates: implementation sub-tasks + Verification Gate sub-task
mobius loop executes implementation sub-tasks via /execute-issue
When all implementation sub-tasks are Done, Verification Gate becomes unblocked
mobius detects "Verification Gate" in title → routes to /verify-issue
verify-issue runs multi-agent review on the parent issue
On FAIL: reopens failing implementation sub-tasks → mobius loop continues
On PASS: marks Verification Gate Done → parent issue can be completed </verification_subtask_context>

<issue_context_phase> <fetch_issue> First, retrieve full issue details using the backend-appropriate fetch tool.

Extract:

Title and description: What was supposed to be built
Acceptance criteria: Checklist of requirements (look for checkbox patterns)
Labels: Bug/Feature/Improvement for context
Priority: Urgency level
Related issues: Context from connected work </fetch_issue>

<fetch_comments> Get implementation context from comments using the backend-appropriate list comments tool.

Look for:

Implementation notes from execute-issue
Design decisions or constraints
Questions or clarifications
Commit references </fetch_comments>

<fetch_subtasks> If issue has sub-tasks, get their status using the backend-appropriate list sub-tasks tool.

Verify:

All sub-tasks are in "Done" or "In Progress" (ready for review) state
No sub-tasks are still blocked or in Backlog
Each sub-task has completion comments </fetch_subtasks>

<context_summary> Build verification context:

markdown

# Verification Context

## Issue: {ID} - {Title}
**Type**: {Bug/Feature/Improvement}
**Priority**: {level}

## Description
{Full description}

## Acceptance Criteria
- [ ] Criterion 1
- [ ] Criterion 2
- [ ] Criterion 3

## Sub-tasks
| ID | Title | Status | Files Modified |
|----|-------|--------|----------------|
| ... | ... | ... | ... |

## Implementation Notes (from comments)
{Key decisions, constraints, commit references}

</context_summary> </issue_context_phase>

<implementation_analysis_phase> <git_analysis> Analyze recent commits related to the issue:

bash

# Find commits referencing the issue
git log --oneline --all --grep="{issue-id}" | head -20

# Get the branch if working on feature branch
git branch --contains | head -5

# Show files changed in recent commits
git log --oneline --name-only -10

Extract:

Commit messages and hashes
Files created or modified
Commit authors and dates </git_analysis>

<code_review> For each modified file, perform code review:

Read the file to understand what was implemented
Check for patterns: Does it follow codebase conventions?
Verify completeness: Does the code address the acceptance criteria?
Identify concerns: Any potential bugs, edge cases, or issues?

Focus areas:

Error handling
Input validation
Edge cases
Type safety
Test coverage
Documentation </code_review>

<test_file_review> Review corresponding test files:

Do tests exist for new functionality?
Do tests cover edge cases mentioned in acceptance criteria?
Are tests meaningful (not just coverage padding)?
Do test names describe behavior clearly? </test_file_review> </implementation_analysis_phase>

<verification_checks_phase> <run_tests> Execute the test suite:

bash

# Run all tests
just test

# Or run tests for specific files
just test-file {pattern}

Capture:

Pass/fail count
Any failures with error messages
Coverage information if available </run_tests>

<run_typecheck> Verify type safety:

bash

just typecheck

Capture any type errors or warnings. </run_typecheck>

<run_lint> Check code quality:

bash

just lint
# or
bun run lint

Note any linting issues. </run_lint>

<check_cicd_status> Verify CI/CD pipeline status before approving:

bash

# Check if there's an open PR for the current branch
gh pr view --json number,state,statusCheckRollup 2>/dev/null

# If no PR, check the latest workflow runs for the branch
gh run list --branch $(git branch --show-current) --limit 5

# Get detailed status of the most recent run
gh run view --json status,conclusion,jobs

CI/CD Check Logic:

If PR exists: Use statusCheckRollup to get all check statuses
- All checks PASS: CI status = PASS
- Any check PENDING: CI status = PENDING (wait or note in review)
- Any check FAILURE: CI status = FAIL
If no PR: Check latest workflow run on branch
- conclusion: success: CI status = PASS
- conclusion: failure: CI status = FAIL
- status: in_progress: CI status = PENDING
If no CI configured: Note this in review (CI status = N/A)

Important: A failing CI/CD status should block PASS recommendation. The implementation may be correct, but if CI is failing, it's not ready to merge.

bash

# Example: Parse PR check status
gh pr view --json statusCheckRollup --jq '.statusCheckRollup[] | "\(.name): \(.conclusion // .status)"'

# Example: Get workflow run conclusion
gh run list --branch $(git branch --show-current) --limit 1 --json conclusion,status --jq '.[0]'

</check_cicd_status>

<verification_summary> Compile verification results:

markdown

## Verification Checks

| Check | Status | Details |
|-------|--------|---------|
| Tests | PASS/FAIL | X passed, Y failed |
| Typecheck | PASS/FAIL | {error count if any} |
| Lint | PASS/FAIL | {warning count if any} |
| CI/CD | PASS/FAIL/PENDING/N/A | {workflow status, failed jobs if any} |

CI/CD blocking logic: If CI/CD status is FAIL, the overall verification status cannot be PASS, even if all other checks pass. A failing pipeline indicates the code is not ready for merge. </verification_summary> </verification_checks_phase>

<criteria_comparison_phase> <criterion_by_criterion> For each acceptance criterion, evaluate:

Is it addressed? - Code exists that implements this requirement
Is it complete? - All aspects of the criterion are handled
Is it testable? - There are tests verifying this behavior
Is it correct? - The implementation matches the intent

Mark each criterion:

PASS: Fully implemented, tested, and working
PARTIAL: Implemented but incomplete or missing tests
FAIL: Not implemented or broken
UNCLEAR: Cannot determine from code review alone </criterion_by_criterion>

<criteria_matrix> Build a criteria evaluation matrix:

markdown

## Acceptance Criteria Evaluation

| # | Criterion | Status | Evidence | Notes |
|---|-----------|--------|----------|-------|
| 1 | {criterion text} | PASS | {file:line or test name} | {any notes} |
| 2 | {criterion text} | PARTIAL | {what's missing} | {recommendations} |
| 3 | {criterion text} | FAIL | {what's wrong} | {fix needed} |

</criteria_matrix> </criteria_comparison_phase>

<multi_agent_review> Spawn four specialized review agents IN PARALLEL using Task tool:

Agent 1: Bug & Logic Detection

Task tool:
  subagent_type: feature-dev:code-reviewer
  prompt: |
    Analyze implementation for bugs and logic errors.

    Context:
    - Issue: {parent_id} - {title}
    - Acceptance Criteria: {all_criteria}
    - Files: {all_modified_files}

    Focus:
    - Logic errors, off-by-one bugs
    - Business logic vs requirements
    - Edge case handling
    - Error handling completeness

    Output (structured):
    CRITICAL: [list with file:line]
    IMPORTANT: [list]
    EDGE_CASES_MISSING: [list]
    PASS: true/false

Agent 2: Code Structure & Best Practices

Task tool:
  subagent_type: feature-dev:code-reviewer
  prompt: |
    Review code structure and codebase patterns.

    Files: {all_modified_files}

    Focus:
    - Codebase convention adherence
    - Code smells and anti-patterns
    - Readability and maintainability
    - Appropriate abstractions

    Output (structured):
    CODE_SMELLS: [list with file:line]
    PATTERN_VIOLATIONS: [list]
    ARCHITECTURE_CONCERNS: [list]
    PASS: true/false

Agent 3: Performance & Security

Task tool:
  subagent_type: feature-dev:code-reviewer
  prompt: |
    Analyze performance and security.

    Files: {all_modified_files}

    Focus:
    - N+1 queries, unnecessary loops
    - Memory leaks, resource cleanup
    - Input validation
    - Authorization checks
    - Sensitive data handling

    Output (structured):
    PERFORMANCE_ISSUES: [list with severity]
    SECURITY_VULNERABILITIES: [list with severity]
    PASS: true/false

Agent 4: Test Quality & Coverage

Task tool:
  subagent_type: feature-dev:code-reviewer
  prompt: |
    Evaluate test quality and coverage.

    Run: just test --coverage (or equivalent)
    Threshold: {coverage_threshold}% (default 80%)

    Test Files: {test_files}
    Source Files: {source_files}

    Focus:
    - Coverage percentage vs threshold
    - Test meaningfulness (not coverage padding)
    - Edge case test presence
    - Mock appropriateness

    Output (structured):
    COVERAGE_PERCENT: number
    THRESHOLD_MET: true/false
    MISSING_TESTS: [list]
    TEST_QUALITY_ISSUES: [list]
    PASS: true/false

Multi-Grader Aggregation

After all agents complete, aggregate using this logic:

if any(agent.CRITICAL or agent.SECURITY_VULNERABILITIES with severity=high):
    overall = FAIL
elif any(agent.IMPORTANT) or not test_agent.THRESHOLD_MET:
    overall = NEEDS_WORK
elif all(agent.PASS):
    overall = PASS
else:
    overall = PASS_WITH_NOTES

Aggregation Report Format:

markdown

## Multi-Agent Verification Results

### Agent 1: Bug & Logic Detection
Status: {PASS/FAIL}
{findings summary}

### Agent 2: Code Structure
Status: {PASS/FAIL}
{findings summary}

### Agent 3: Performance & Security
Status: {PASS/FAIL}
{findings summary}

### Agent 4: Test Quality
Status: {PASS/FAIL}
Coverage: {X}% (threshold: {Y}%)
{findings summary}

### Overall Status: {PASS/PASS_WITH_NOTES/NEEDS_WORK/FAIL}

</multi_agent_review>

<identify_improvements> Categorize findings from all agents:

Critical Issues (must fix):

Bugs that break functionality
Missing critical acceptance criteria
Security vulnerabilities (high severity)
Logic errors identified by Agent 1

Important Issues (should fix):

Missing edge case handling
Incomplete test coverage (below threshold)
Code quality concerns from Agent 2
Performance issues from Agent 3

Suggestions (nice to have):

Refactoring opportunities
Performance optimizations
Documentation improvements

Questions (need clarification):

Ambiguous requirements
Design decisions to verify
Edge cases not specified </identify_improvements>

<rework_loop> AUTONOMOUS ACTION: On FAIL or NEEDS_WORK, IMMEDIATELY implement the rework loop without asking the user. Reopening failing sub-tasks with feedback is a required part of the verification workflow.

On FAIL or NEEDS_WORK, implement the rework loop:

1. Map Findings to Sub-Tasks

Match each finding's file to the sub-task that modified it:

For each finding with file:line reference:
  1. Check git blame or sub-task comments for file ownership
  2. Map finding to the responsible sub-task
  3. Group all findings by sub-task ID

2. Add Feedback Comment to Each Failing Sub-Task

Use backend-appropriate add comment tool:

markdown

## Verification Feedback: NEEDS_REWORK

### Issues Found

**Critical** (must fix):
- {issue with file:line reference}

**Important** (should fix):
- {issue description}

### Recommended Fixes
{Specific guidance from review agents}

### Re-verification
After addressing these issues, the verification gate will automatically re-run when all implementation sub-tasks are complete again.

---
*Feedback from verify-issue multi-agent review*

3. Move Sub-Task Back to "To Do"

Linear:

mcp__plugin_linear_linear__update_issue:
  id: {sub_task_id}
  state: "Backlog"  # or "To Do" depending on workflow
  labels: ["needs-rework"]

Jira:

mcp_plugin_atlassian_jira__transition_issue:
  issueIdOrKey: {sub_task_key}
  transitionName: "To Do"

4. Update Verification Sub-Task

The verification sub-task remains blocked (its blockers moved back to non-Done). Add a comment documenting this rework iteration:

markdown

## Rework Iteration {N}

**Date**: {timestamp}
**Status**: Sub-tasks returned for rework

### Sub-Tasks Reopened
- {sub_task_1}: {reason summary}
- {sub_task_2}: {reason summary}

### Awaiting
All implementation sub-tasks to reach "Done" status before re-verification.

Loop Continuation

The loop continues naturally:

Loop polls for ready tasks
Picks up reopened sub-tasks
Executes them via execute-issue
When all implementation sub-tasks Done, verification sub-task unblocks
Verification runs again

Max Iterations: After max_rework_iterations (default 3) rework cycles, escalate:

Add "escalation-needed" label
Comment with full history
Do NOT block indefinitely

On PASS or PASS_WITH_NOTES

Mark verification sub-task Done:
- Linear: update_issue(state: "Done")
- Jira: transition_issue(transitionName: "Done")
Post verification report to parent issue as comment
Parent can now be completed (no longer blocked by verification sub-task) </rework_loop>

<review_report_phase> <report_structure> Generate a structured review report:

markdown

## Verification Report: {Issue ID}

### Summary
**Overall Status**: PASS / PASS_WITH_NOTES / NEEDS_WORK / FAIL
**Criteria Met**: X of Y
**Tests**: PASS / FAIL
**Typecheck**: PASS / FAIL

### Acceptance Criteria Evaluation
| # | Criterion | Status | Notes |
|---|-----------|--------|-------|
| 1 | ... | PASS | ... |
| 2 | ... | PARTIAL | ... |

### Verification Checks
- Tests: X passed, Y failed
- Typecheck: {status}
- Lint: {status}

### Implementation Review

**What was done well**:
- {positive observation 1}
- {positive observation 2}

**Critical Issues** (must fix before closing):
- {issue 1}
- {issue 2}

**Important Issues** (should address):
- {issue 1}
- {issue 2}

**Suggestions** (consider for future):
- {suggestion 1}
- {suggestion 2}

### Files Reviewed
- `{file1}` - {summary}
- `{file2}` - {summary}

### Recommendation
{APPROVE / REQUEST_CHANGES / DISCUSS}

{Closing summary with next steps}

</report_structure>

<status_definitions> Overall Status meanings:

Status	Meaning	Action
PASS	All criteria met, tests pass, no issues	Close issue
PASS_WITH_NOTES	Criteria met with minor suggestions	Close issue, optionally address suggestions
NEEDS_WORK	Some criteria not met or tests fail	Keep open, address issues
FAIL	Critical issues or many criteria not met	Keep open, major rework needed

Recommendation meanings:

Recommendation	Meaning
APPROVE	Ready to close, no blocking issues
REQUEST_CHANGES	Issues need resolution before closing
DISCUSS	Ambiguities need team input
</status_definitions>
</review_report_phase>

<ticket_update_phase> <post_review_comment> Add the review as a comment on the issue using the backend-appropriate add comment tool:

markdown

## Verification Review

**Status**: {PASS/PASS_WITH_NOTES/NEEDS_WORK/FAIL}
**Recommendation**: {APPROVE/REQUEST_CHANGES/DISCUSS}

### Acceptance Criteria
{criteria evaluation matrix}

### Checks
- Tests: {status}
- Typecheck: {status}

### Findings
{condensed findings - critical issues and important issues}

### Next Steps
{clear action items}

---
*Automated verification by verify-issue*

</post_review_comment>

<update_issue_status> Based on review outcome:

If PASS or PASS_WITH_NOTES: Use the backend-appropriate update status tool to move the issue to "Done".

If NEEDS_WORK or FAIL: Leave in current state. The comment documents what needs to be addressed.

Optionally add labels (if supported by backend):

Add "needs-revision" label for issues that need more work </update_issue_status> </ticket_update_phase>

<completion_report> <report_format> Output a summary for the user:

markdown

# Verification Complete

## Issue: {ID} - {Title}

**Status**: {PASS/PASS_WITH_NOTES/NEEDS_WORK/FAIL}
**Recommendation**: {APPROVE/REQUEST_CHANGES/DISCUSS}

### Summary
- Acceptance Criteria: {X of Y} met
- Tests: {status}
- Typecheck: {status}
- Lint: {status}

### Key Findings
{Top 3-5 findings}

### Actions Taken
- [x] Review comment posted to ticket
- [x] Issue status updated (if PASS)
- [ ] Follow-up issues created (if applicable)

### Next Steps
{Clear recommendations}

</report_format>

<follow_up_issues> If critical or important issues are found that won't be fixed immediately, use the backend-appropriate create issue tool to create follow-up issues.

Include:

Clear title describing the issue
Reference to the original issue in the description
Appropriate labels (e.g., "follow-up")
Related issue link

Link follow-up issues in the verification comment. </follow_up_issues>

<status_markers> IMPORTANT: Output a STATUS marker at the end of execution for mobius loop detection.

On PASS or PASS_WITH_NOTES:

STATUS: SUBTASK_COMPLETE

On FAIL or NEEDS_WORK (after reopening sub-tasks):

STATUS: VERIFICATION_FAILED

The mobius loop monitors agent output for these markers to determine execution results. </status_markers> </completion_report>

Issue: PROJ-100 - Add dark mode support

Findings:

All 5 acceptance criteria met
Tests pass (12 new tests added)
Typecheck clean
Code follows existing patterns

Output:

markdown

## Verification Review

**Status**: PASS
**Recommendation**: APPROVE

### Acceptance Criteria
| # | Criterion | Status |
|---|-----------|--------|
| 1 | Theme follows system preference by default | PASS |
| 2 | Settings screen has theme toggle | PASS |
| 3 | All text maintains 4.5:1 contrast ratio | PASS |
| 4 | Theme preference persists across restarts | PASS |
| 5 | No flash of wrong theme on launch | PASS |

### Checks
- Tests: 12 passed, 0 failed
- Typecheck: PASS

### What was done well
- Clean separation of theme logic into ThemeProvider
- Comprehensive test coverage for all modes
- Proper localStorage persistence

All criteria met. Ready to close.

</pass_example>

<needs_work_example> Input: /verify PROJ-200

Issue: PROJ-200 - Fix schedule deactivation error

Findings:

2 of 3 acceptance criteria met
Tests pass but missing edge case coverage
Typecheck clean
Missing error handling for concurrent deactivation

Output:

markdown

## Verification Review

**Status**: NEEDS_WORK
**Recommendation**: REQUEST_CHANGES

### Acceptance Criteria
| # | Criterion | Status | Notes |
|---|-----------|--------|-------|
| 1 | User can deactivate without error | PASS | Works for single user |
| 2 | Schedule status updates to inactive | PASS | Verified |
| 3 | Team members see status change | PARTIAL | No sync test, potential race condition |

### Checks
- Tests: 8 passed, 0 failed
- Typecheck: PASS

### Critical Issues
- No handling for concurrent deactivation attempts
- Missing PowerSync conflict resolution

### Next Steps
1. Add optimistic locking or conflict resolution
2. Add multi-user test for sync scenario
3. Re-verify after changes

</needs_work_example>

<anti_patterns> Don't skip code review:

BAD: Only run tests without reading the code
GOOD: Review implementation against each acceptance criterion

Don't be superficial:

BAD: "Tests pass, looks good"
GOOD: Thorough analysis of correctness, completeness, quality

Don't nitpick on style:

BAD: Flag every style preference as an issue
GOOD: Focus on correctness, completeness, and maintainability

Don't approve incomplete work:

BAD: "2 of 5 criteria met, but PASS"
GOOD: NEEDS_WORK until all criteria are addressed

Don't skip the ticket comment:

BAD: Tell user the results but don't post to ticket
GOOD: Always document verification on the ticket

Don't forget to check sub-tasks:

BAD: Only verify parent issue
GOOD: Verify all sub-tasks are complete before overall review </anti_patterns>

<success_criteria> A successful verification achieves:

Configuration & Setup:

Backend detected from config (linear or jira)
Verification config loaded (coverage_threshold, max_rework_iterations, etc.)
Coverage threshold configurable (default 80%)

Context Gathering (AC 1, 2):

verify-issue receives verification sub-task ID as input (from mobius loop)
Parent issue identified from the verification sub-task
Full parent issue context loaded (description, criteria, comments)
All sibling implementation sub-tasks collected and analyzed
Aggregated context from parent AND all implementation sub-tasks
All modified files identified across implementation sub-tasks

Quality Gate Dimensions (evaluated during review):

Testing (all tests pass, coverage >= threshold)
Code structure (best practices, no code smells)
Performance (no regressions identified)
Security (no vulnerabilities found)
Business logic correctness (matches requirements)
User story satisfaction (solves user's problem)

Multi-Agent Review:

All 4 specialized review agents spawned in parallel (multi-agent verification)
Agent 1: Bug & Logic Detection completed
Agent 2: Code Structure & Best Practices completed
Agent 3: Performance & Security completed
Agent 4: Test Quality & Coverage completed
Multi-grader aggregation computed overall status

Verification Checks:

Tests executed and results captured
Coverage threshold evaluated against configurable value
Typecheck and lint run
CI/CD status checked

Criteria Evaluation:

Each acceptance criterion evaluated with evidence
Findings categorized (Critical, Important, Suggestions, Questions)

Rework Loop (AC 6, 7, 9 - on FAIL/NEEDS_WORK):

Findings mapped to responsible sub-tasks
Failing sub-tasks receive detailed feedback comments with file:line references
Failing sub-tasks moved back to "To Do" status
Rework iteration count tracked (up to max_rework_iterations)
Rework iteration documented on verification sub-task
Loop continues naturally (reopened tasks picked up by normal polling)

Completion (AC 8 - on PASS/PASS_WITH_NOTES):

Verification sub-task marked Done when quality checks pass
Verification report posted to parent issue
Parent issue unblocked for completion (no longer blocked by verification sub-task)

Reporting:

Structured review report generated
Review comment posted to ticket
Clear next steps communicated to user </success_criteria>

Search AI Tools

Install this agent skill to your Project

SKILL.md

Agent 1: Bug & Logic Detection

Agent 2: Code Structure & Best Practices

Agent 3: Performance & Security

Agent 4: Test Quality & Coverage

Multi-Grader Aggregation

1. Map Findings to Sub-Tasks

2. Add Feedback Comment to Each Failing Sub-Task

3. Move Sub-Task Back to "To Do"

4. Update Verification Sub-Task

Loop Continuation

On PASS or PASS_WITH_NOTES