Agent skills
implementation-postmortem

Agent skill

implementation-postmortem

Conduct structured implementation postmortems to gather feedback on architecture conformance, library friction, and tooling effectiveness. Use when reviewing completed implementations, PRs, or development phases to surface design gaps, boundary violations, and improvement opportunities. Triggers on requests for code review feedback, implementation retrospectives, architecture audits, or library/tooling evaluations.

View SKILL.md on GitHub Repository

Stars 1

Forks 0

Install this agent skill to your Project

npx add-skill https://github.com/leynos/agent-helper-scripts/tree/main/skills/implementation-postmortem

SKILL.md

Implementation Postmortem Agent

This skill guides structured postmortem analysis of completed implementations. The goal is adversarial review: surface friction, identify architectural drift, challenge assumptions. Implementers can handle honest critique.

Workflow

Phase 1: Context Gathering

Before conducting a postmortem, gather sufficient context. Never assume—ask.

1.1 Obtain PR/Implementation Summary

If reviewing a PR, fetch the summary:

bash

# Get PR details including description and changed files
gh pr view <PR_NUMBER> --json title,body,files,commits,additions,deletions

# Get the diff for detailed analysis
gh pr diff <PR_NUMBER>

# List files changed
gh pr view <PR_NUMBER> --json files --jq '.files[].path'

For non-PR work, request:

The implementation scope (what was built)
Entry points and key files
Any design documents or ADRs referenced

1.2 Establish Architecture Context

Ask these questions if the architecture is not already known:

Structural questions:

What architectural pattern does this codebase follow? (hexagonal/ports-adapters, MVC, layered, event-driven, actor-based, codec pipeline, etc.)
What are the primary module/crate boundaries?
What invariants must the architecture preserve?

Implementation questions:

What in-house libraries were used? What are they meant to do?
What tooling was used during development? (test frameworks, code analysis, documentation tools)
Were there design documents or specifications? Where do they live?

Scope questions:

What was the goal of this implementation phase?
What constraints or deadlines applied?
Were any shortcuts intentionally taken (and documented)?

Phase 2: Select Assessment Framework

Based on the architecture, load the appropriate reference template:

Architecture Pattern	Reference File
Hexagonal (ports/adapters)	`references/hexagonal-template.md`
MVC / Action-Command pipeline	`references/mvc-action-template.md`
Codec / Protocol pipeline	`references/codec-template.md`
Other / Custom	Use core dimensions below, adapt as needed

If the architecture doesn't match a template, use the Core Postmortem Dimensions (Section 3) and adapt terminology.

Phase 3: Conduct Assessment

Work through each dimension systematically. For each finding:

Cite evidence — file:line references, specific code patterns, measurable data
Classify severity — architectural violation (fix now) vs technical debt (track and schedule)
Distinguish symptom from cause — "slow" is a symptom; "O(n²) loop in hot path" is a cause
Note spec ambiguity — where design docs failed to answer a question the implementation faced

Core Postmortem Dimensions

These dimensions apply regardless of architecture. Architecture-specific templates extend them.

3.1 Specification Fidelity

Divergences between spec and implementation (intentional vs accidental)
Ambiguities in spec that caused implementation friction
Missing requirements discovered during implementation
Requirements that proved unnecessary or misguided

Key question: Where did the spec lie by omission?

3.2 Boundary Integrity

Every architecture defines boundaries. Assess:

Are boundaries enforced by the module/crate system?
What crosses boundaries that shouldn't?
Are boundary-crossing types appropriately abstract?

Smell test: If you had to replace one component (database, UI framework, protocol), what would break that shouldn't?

3.3 State Management

Where does authoritative state live?
Is there derived state that can drift from source?
Are state transitions explicit and auditable?

3.4 Error Handling

Error taxonomy: are different error categories (validation, I/O, business logic) distinguishable?
Recovery semantics: what errors are recoverable? How?
Observability: are errors logged with sufficient context?

3.5 Testability

Can components be tested in isolation?
Are there integration tests for boundary crossings?
What's untested that should be?

3.6 In-House Library Evaluation

For each in-house library used:

## [Library Name]

### Fit for Purpose
- How well did the library's model match implementation needs?
- Impedance mismatches requiring workarounds?

### What Worked
- Specific positive example with context

### What Hurt
- Specific friction point
- Impact: [time lost / workaround complexity / bug introduced]
- Suggested fix or documentation improvement

### Documentation Gaps
- What you searched for but didn't find
- What was present but wrong/stale

3.7 Tooling Effectiveness

For each tool used (test frameworks, analysis tools, documentation generators, MCP servers):

Tool	Purpose	Effectiveness	Recommendation
			Keep / Improve / Retire

Questions per tool:

Did it surface useful insights or noise?
Integration friction with workflow?
False positives/negatives?
Where did it fail you?

Output Format

Structure the postmortem as:

Executive Summary (5 bullets maximum, ranked by severity)
Specification Gaps (ranked by impact)
Architecture Assessment (using appropriate template)
Boundary Violations (with file:line references where possible)
Library Feedback (per-library structured assessment)
Tooling Report Card (keep/improve/retire recommendations)
Recommendations (concrete, actionable, with effort estimates: S/M/L)

Conduct Guidelines

Cite evidence. "The adapter felt bloated" → "OrderAdapter grew to 400 lines; 60% is validation logic that belongs in domain"
Distinguish symptoms from causes. "Tests are slow" is a symptom; "each test spins up a real database" is a cause.
Separate architectural violations from technical debt. Violations need immediate attention; debt can be scheduled.
Acknowledge what worked. If something worked well, say so briefly and move on—dwell on what needs attention.
Measure against the spec. The design documents are the contract. If no spec exists, note that as a finding.
Note spec ambiguity as feedback. Where the spec was unclear and implementation chose reasonably, feed that back to improve the spec.
Be direct. The implementer is reading this to improve. Hedging wastes their time.

Architecture-Specific Templates

For detailed assessment criteria, see:

references/hexagonal-template.md — Domain/ports/adapters pattern
references/mvc-action-template.md — MVC with action/command pipelines (e.g., GPUI-based apps)
references/codec-template.md — Protocol codec and framing pipelines

Load the appropriate template based on the architecture identified in Phase 1.

Maintainer

leynos Core maintainer

Source details

Full Name: leynos/agent-helper-scripts
Branch: main
Path in repo: skills/implementation-postmortem

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

leynos/agent-helper-scripts

logisphere-design-review

Pre-implementation design review framework using the df12 Logisphere crew. Stress-tests system designs, RFCs, ADRs, API proposals, data models, and architecture decisions before code gets written. Each expert examines the design through their specialist lens — structural integrity (Pandalump), alternative approaches (Wafflecat), scaling characteristics (Buzzy Bee), contract design (Telefono), failure modes (Doggylump), and long-term viability (Dinolump). Includes a structured pre-mortem and alternatives checkpoint. Use this skill when asked to review a design document, RFC, ADR, system proposal, API design, or architecture decision — or when asked "should we build it this way", "what could go wrong", "design review", "pre-mortem", "architecture review", "RFC review", or any request for pre-implementation feedback.

1 0

Explore

leynos/agent-helper-scripts

biome-typescript

Configure and use Biome (biomejs) for TypeScript linting and formatting. Use when setting up Biome in a project, configuring lint rules, migrating from ESLint/Prettier, fixing lint errors, setting up CI pipelines with Biome, or configuring git hooks for code quality. Covers biome.json configuration, file inclusion/exclusion patterns, rule overrides, and integration with build tooling.

1 0

Explore

leynos/agent-helper-scripts

code-review

Conduct thorough, actionable code reviews that catch real problems without drowning in noise

1 0

Explore

leynos/agent-helper-scripts

execplans

Write and maintain self-contained ExecPlans (execution plans) that a novice can follow end-to-end; use when planning or implementing non-trivial repo changes.

1 0

Explore

leynos/agent-helper-scripts

leta

Fast semantic code navigation via LSP. Load FIRST before ANY code task - even 'simple' ones. Trigger scenarios: (1) fixing lint/type/pyright/mypy warnings or errors, (2) fixing reportAny/reportUnknownType/Any type errors, (3) adding type annotations, (4) refactoring or modifying code, (5) finding where a function/class/symbol is defined, (6) finding where a symbol is used/referenced/imported, (7) understanding what a function calls or what calls it, (8) exploring unfamiliar code or understanding architecture, (9) renaming symbols across codebase, (10) finding interface/protocol implementations, (11) ANY task where you'd use ripgrep to find code or read-file to view a function. Use `leta show SYMBOL` instead of read-file, `leta refs SYMBOL` instead of ripgrep for usages, `leta grep PATTERN` instead of ripgrep for definitions, `leta files` instead of list-directory.

1 0

Explore

leynos/agent-helper-scripts

logisphere-experts

Community-of-experts review framework using the df12 Logisphere crew for software engineering tasks. Each expert brings a distinct engineering perspective: architecture (Pandalump), creative alternatives (Wafflecat), performance and observability (Buzzy Bee), type safety and contracts (Telefono), reliability and ops (Doggylump), and developer experience (Dinolump). Use this skill when asked to review code, design systems, evaluate architecture decisions, debug complex issues, assess production readiness, or when a thorough multi-perspective engineering analysis is needed. Triggers include: "review this", "what do you think of this design", "is this production-ready", "logisphere review", "expert review", "community review", "crew review", or any request for comprehensive engineering feedback.

1 0

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Implementation Postmortem Agent

Workflow

Phase 1: Context Gathering

1.1 Obtain PR/Implementation Summary

1.2 Establish Architecture Context

Phase 2: Select Assessment Framework

Phase 3: Conduct Assessment

Core Postmortem Dimensions

3.1 Specification Fidelity

3.2 Boundary Integrity

3.3 State Management

3.4 Error Handling

3.5 Testability

3.6 In-House Library Evaluation

3.7 Tooling Effectiveness

Output Format

Conduct Guidelines

Architecture-Specific Templates

Recommended Agent Skills

logisphere-design-review

biome-typescript

code-review

execplans

leta

logisphere-experts