Agent skill
tasking-and-proof
How the orchestrator must create/manage MCP tasks (todo/doing/done), set required_gates, handle blockers/questions, and record low-context proof for completion.
Install this agent skill to your Project
npx add-skill https://github.com/frostaura/ai.toolkit.gaia/tree/main/.github/skills/tasking-and-proof
SKILL.md
Tasking & Proof (MCP Contract)
When to use
Use for every planned unit of work. This skill defines the procedural workflow for task management and proof recording.
Task model, MCP tool signatures, error codes, and canonical gate vocabulary live in
AGENTS.md§7, §10, §12. This skill is the procedure only.
Orchestrator supremacy
- Only the Workload Orchestrator owns the authoritative task graph.
- Other agents may suggest tasks, but orchestrator calls
tasks_create. - Subagents can use
tasks_*tools for their own isolated task tracking when delegated complex work.
Step 1 — Create a complete task graph (planning)
Call tasks_create(project, title, requiredGates[]) for each unit of work:
- Repo drift fixes (docs↔code) (blocking if present)
- Skill drift fixes (blocking if present)
- CI fixes/additions (blocking if missing/failing)
- Dockerize HTTP API (blocking for use-case work if HTTP API)
- Docs/spec changes (use-cases, architecture, testing)
- Implementation
- Tests (unit/integration/e2e as required)
- Manual regression (as required)
- QA Gatekeeper review (always)
Keep tasks small but complete; prefer multiple tasks over one mega task.
Step 2 — Set required_gates[] explicitly (no ambiguity)
Use gate labels from AGENTS.md §10 Canonical gate vocabulary:
- Baseline (always):
lint,build,ci - Use-case change: add
integration/e2e/manual-regressionas applicable - Docker-first: if HTTP API and compose missing → create dockerize task and gate use-case tasks on it
Step 3 — In-flight task creation (mandatory)
When you discover TODOs, missing foundations, new scope requirements, or risky unknowns:
- Call
tasks_createimmediately or add toblockers[]viatasks_update. - “No TODO left behind”: do not leave TODO comments without a corresponding MCP task or blocker.
Step 4 — Blockers + “needs input” mode
Use tasks_flag_needs_input(project, id, questions[]) when:
- secrets/credentials missing
- environment cannot run
- unclear requirements (must ask user)
Rules:
- A task with blockers cannot be marked done.
- Continue parallelizable work while waiting on input.
Step 5 — Completion proof (MCP args only)
Call tasks_mark_done(project, id, changedFiles[], testsAdded[], manualRegressionLabels[]):
changedFiles[]: all files modified for this task (paths only)testsAdded[]: new/updated test files (paths only)manualRegressionLabels[]: labels performed for this task (e.g.curl,playwright-mcp)
Rule: proof is link-only (paths/labels). Do NOT paste logs.
Step 6 — Enforced failures (expected MCP behavior)
tasks_mark_done refuses with these error codes (see AGENTS.md §7):
GAIA_TASKS_ERR_BLOCKERS_UNRESOLVED— blockers existGAIA_TASKS_ERR_NEEDS_INPUT_UNRESOLVED— human input pendingGAIA_TASKS_ERR_GATES_UNSATISFIED— required gates not metGAIA_TASKS_ERR_MISSING_PROOF_ARGS— proof arrays empty
Agents must treat these errors as instructions for next actions.
Step 7 — QA Gatekeeper coupling
Before final completion:
- QA Gatekeeper reviews tasks for gate satisfaction + proof consistency.
- If vetoed: create/fix tasks until approved.
References
AGENTS.md(§7 Task model, §10 Gate vocabulary, §12 Memory/self-improvement).github/skills/gaia-process/SKILL.md.github/agents/gaia-quality-gatekeeper.md
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
stack-default-dotnet-api
Opinionated baseline for .NET HTTP APIs: analyzers/formatting, unit+integration strategy, docker-compose, Makefile targets, and CI wiring.
integration-testing-http
Run curl-style integration checks against the docker-compose stack for HTTP APIs. Required for use-case changes.
spec-consistency
Prevent and repair drift between `/docs` (source of truth), code, tests, CI, and runtime artifacts. Use before marking work done.
gaia-process
End-to-end Gaia SDLC workflow (Repo Explorer → drift/CI fixes → task graph → gated delivery → QA veto → MCP proof). Use for any work in a repo.
stack-default-web-ts
Opinionated baseline for JS/TS web repos: lint/format, tests, Playwright, docker (if HTTP API), Makefile targets, and CI wiring.
manual-regression-api
Manual backend regression via curl-like HTTP checks against the docker-compose stack. Required for API use-case changes (label: curl).
Didn't find tool you were looking for?