Agent skill

formal-verify

Continuous formal verification of architectural constraints and code quality. Use when asked to verify, audit, or validate codebase integrity. Runs automatically via hooks on every edit (structural) and pre-commit (full). Catches ownership violations, boundary crossings, state machine bugs, and code smells that grep ratchets miss. Triggers: "verify", "formal verify", "check architecture", "audit code quality", "run verification", "/verify", "/verify --bootstrap", "/verify --grade".

View SKILL.md on GitHub Repository

Stars 20

Forks 6

Install this agent skill to your Project

npx add-skill https://github.com/petekp/claude-code-setup/tree/main/skills/formal-verify

Metadata

Additional technical details for this skill

author: petekp
version: 0.1.0

SKILL.md

formal-verify

Use this skill when architectural intent matters more than "it compiles."

This skill runs a three-layer verification loop:

Layer 1: structural verification over extracted AST facts and declarative rules
Layer 2: behavioral verification over Z3Py protocol specs and TLA+/Apalache state-machine specs
Layer 3: elegance auditing over complexity, consistency, and craft heuristics

The layers are intentionally tiered:

every edit: Layer 1 only, fast enough for continuous feedback
slice checkpoint: Layers 1 and 2
pre-commit and manual /verify: all three layers

Quick Start

Bootstrap a target project with:

bash

/verify --bootstrap

Bootstrap runs four phases:

Install dependencies and create .verifier/
Discover architectural rules from docs and code shape
Interview the user in plain English about ambiguities
Validate the initial rules against the current codebase

Commands

/verify Runs all layers in verbose mode and prints a unified report.
/verify --bootstrap Installs dependencies, creates .verifier/, and scaffolds the first rule set.
/verify --evolve Checks for drift between architectural docs and existing verification specs.
/verify --grade Runs Layer 3 only and reports the current elegance grade.

How Verification Runs

Layer 1: Structural

The runner extracts facts from Rust and Swift source files, then checks structural.yaml rules such as:

only module X may cross boundary Y
modules matching pattern Z must implement interface W
all modules must not reference legacy identifiers

Structural checks are the default PostToolUse hook because they are the fastest.

Layer 2: Behavioral

Behavioral verification covers state transitions and protocol contracts:

TLA+/Apalache for temporal properties, liveness, and interleavings
Z3Py spec files for contracts, invariants, and cross-boundary data guarantees

Use this layer at slice checkpoints, before risky merges, and whenever a change touches coordination logic or cross-language contracts.

Layer 3: Elegance

Elegance auditing scores code for:

complexity
consistency
craft

It produces a grade and line-level deductions so the agent can clean up code, not just make it technically correct.

Violation Handling

When a violation is found, tailor the output to the audience:

agent output: counterexample, diagnosis, concrete fix suggestion
human output: counterexample and diagnosis only

If the agent fails to resolve the same violation three times, stop the fix loop and escalate with:

the original rule
the counterexample
the three attempted fixes
what still appears to block a correct repair

Project Structure Created In The Target Repo

Bootstrap creates and maintains:

text

.verifier/
├── structural.yaml
├── elegance.yaml
├── specs/
├── facts/
└── reports/

structural.yaml stores declarative Layer 1 rules
elegance.yaml stores thresholds and grade policy
specs/ stores Z3Py and TLA+ behavioral specs
facts/ caches extracted AST facts
reports/ stores the most recent verification outputs

facts/ and reports/ should be gitignored in the target project.

Operating Guidance

Run /verify before claiming a migration is complete.
Run /verify --grade when the code is correct but still feels rough.
Prefer updating rules and specs over weakening them when the architecture evolves intentionally.
Keep SKILL.md focused on orchestration; pull detailed mechanics from the references below.

References

@references/layer1-structural.md Fact extraction, Z3 encoding, reachability, and incremental invalidation.
@references/layer2-behavioral.md When to use TLA+/Apalache versus Z3Py, plus spec execution contracts.
@references/layer3-elegance.md Metric families, grading, thresholds, and the Layer 3 sub-module layout.
@references/constraint-yaml-spec.md Structural rule schema, selectors, assertions, and fact pattern operators.
@references/bootstrap-process.md The install, discover, interview, validate bootstrap workflow.
@references/agent-feedback-loop.md Hook integration, violation injection, retries, and escalation policy.
@references/spec-authoring-guide.md Translating plain-English architectural intent into formal specs.

Maintainer

petekp Core maintainer

Source details

Full Name: petekp/claude-code-setup
Branch: main
Path in repo: skills/formal-verify
License: MIT License
Topics: ai anthropic claude skills agents claudecode llm

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

petekp/claude-code-setup

ubiquitous-language

Extract a DDD-style ubiquitous language glossary from the current conversation, flagging ambiguities and proposing canonical terms. Saves to UBIQUITOUS_LANGUAGE.md. Use when user wants to define domain terms, build a glossary, harden terminology, create a ubiquitous language, or mentions "domain model" or "DDD".

20 6

Explore

petekp/claude-code-setup

every-style-editor

This skill should be used when reviewing or editing copy to ensure adherence to Every's style guide. It provides a systematic line-by-line review process for grammar, punctuation, mechanics, and style guide compliance.

20 6

Explore

petekp/claude-code-setup

manage-codex

Autonomous Codex batch orchestrator. Use for "/manage-codex", "manage codex", "use codex", "dispatch to codex", or long-running Codex work.

20 6

Explore

petekp/claude-code-setup

seo-audit

When the user wants to audit, review, or diagnose SEO issues on their site. Also use when the user mentions "SEO audit," "technical SEO," "why am I not ranking," "SEO issues," "on-page SEO," "meta tags review," "SEO health check," "my traffic dropped," "lost rankings," "not showing up in Google," "site isn't ranking," "Google update hit me," "page speed," "core web vitals," "crawl errors," or "indexing issues." Use this even if the user just says something vague like "my SEO is bad" or "help with SEO" — start with an audit. For building pages at scale to target keywords, see programmatic-seo. For adding structured data, see schema-markup. For AI search optimization, see ai-seo.

20 6

Explore

petekp/claude-code-setup

capture-learning

Analyze recent conversation context and capture learnings to project knowledge files (for project-specific insights) or skills/commands/subagents (for cross-project patterns). Use when the user asks to "capture this learning", "update the docs with this", "remember this for next time", "document this issue", "add this to CLAUDE.md", "save this knowledge", or "update project knowledge". Also triggers after resolving build/setup issues, discovering non-obvious patterns, or completing debugging sessions with valuable insights.

20 6

Explore

petekp/claude-code-setup

agent-changelog

Compile an agent-optimized changelog by cross-referencing git history with plans and documentation. Use when asked to "update changelog", "compile history", "document project evolution", or proactively after major milestones, architectural changes, or when stale/deprecated information is detected that could confuse coding agents.

20 6

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

Metadata

SKILL.md

formal-verify

Quick Start

Commands

How Verification Runs

Layer 1: Structural

Layer 2: Behavioral

Layer 3: Elegance

Violation Handling

Project Structure Created In The Target Repo

Operating Guidance

References

Recommended Agent Skills

ubiquitous-language

every-style-editor

manage-codex

seo-audit

capture-learning

agent-changelog