Agent skill
manual-regression-web
Perform manual web regression using Playwright MCP tools (interactive). Required for web use-case changes. Keep evidence low-context (labels + paths only).
Install this agent skill to your Project
npx add-skill https://github.com/frostaura/ai.toolkit.gaia/tree/main/.github/skills/manual-regression-web
SKILL.md
Manual Regression (Web via Playwright MCP)
When to use
Use when:
- Orchestrator flags use-case change affecting web UI, OR
- QA Gatekeeper requires manual regression, OR
- Automated E2E cannot be run in CI yet but validation is still required.
Manual regression is required for web use-case changes.
Inputs
- UC docs:
/docs/use-cases/UC-###-*.md - Running environment (local dev server and/or docker-compose)
- Playwright MCP tool access
Outputs
- Manual validation of UC acceptance criteria using Playwright MCP tools
- MCP proof label recorded:
playwright-mcp - Any added/updated Playwright specs recorded under
tests_added[](if created)
Rules
- Do not paste tool logs or long transcripts.
- Validate observable outcomes aligned to UC acceptance criteria.
- If the environment cannot be run (missing secrets/ports/credentials), create blockers via MCP and keep completion gated.
Step 1 — Ensure system is running
Preferred:
make up(if docker-compose stack)- or
make dev/make run(repo convention) Confirm: - Web app reachable
- Backend reachable if required for the flow
If not reachable:
- fix runtime/compose issues first (blocking)
Step 2 — Create a UC checklist (compact)
For each affected UC:
- list 3–8 checkpoints max:
- navigation start
- key input(s)
- key output(s)
- one critical error/edge path (if relevant)
Step 3 — Execute manual regression with Playwright MCP
Using Playwright MCP tools:
- Navigate the flow
- Capture failures precisely (what step, what expected, what happened)
- If failure found:
- create or update automated Playwright spec if feasible
- otherwise create a task/blocker describing the failure and required fix
Keep interaction tight:
- Prefer direct steps; avoid exploratory wandering
- Repeat only the minimal flow necessary to confirm fix
Step 4 — Record outcomes (low-context)
Do NOT store logs. Do:
- note which UC IDs were manually validated (in task notes if needed)
- record proof label
playwright-mcpin MCPmanual_regression[]
If the manual regression depended on specific local setup:
- add a short note in
/docs/testing/how-to-run.md(only if it’s stable and reusable)
Step 5 — Completion proof (MCP)
When marking task done:
manual_regression[]must include:playwright-mcptests_added[]includes any new/updated spec file pathschanged_files[]includes any config/doc paths changed
Suggested required_gates[]
- For web use-case change:
manual-regression(and typicallye2e) plus baselinelint/build/ci
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
stack-default-dotnet-api
Opinionated baseline for .NET HTTP APIs: analyzers/formatting, unit+integration strategy, docker-compose, Makefile targets, and CI wiring.
tasking-and-proof
How the orchestrator must create/manage MCP tasks (todo/doing/done), set required_gates, handle blockers/questions, and record low-context proof for completion.
integration-testing-http
Run curl-style integration checks against the docker-compose stack for HTTP APIs. Required for use-case changes.
spec-consistency
Prevent and repair drift between `/docs` (source of truth), code, tests, CI, and runtime artifacts. Use before marking work done.
gaia-process
End-to-end Gaia SDLC workflow (Repo Explorer → drift/CI fixes → task graph → gated delivery → QA veto → MCP proof). Use for any work in a repo.
stack-default-web-ts
Opinionated baseline for JS/TS web repos: lint/format, tests, Playwright, docker (if HTTP API), Makefile targets, and CI wiring.
Didn't find tool you were looking for?