Agent skill
smithers
Build multi-phase AI development pipelines with the Smithers workflow engine (v0.8.2). Use when: (1) Initializing a new Smithers project in a target directory (use the init CLI) (2) Adding phases or steps to existing workflows (3) Implementing review loops, pass tracking, or phase gating (4) Debugging workflow orchestration issues (Ralph loops, ctx.output, data threading) (5) Writing instruction MDX files for project configs (6) Configuring agents or per-role overrides
Install this agent skill to your Project
npx add-skill https://github.com/enitrat/skill-issue/tree/main/plugins/personal-skills/skills/smithers
SKILL.md
Smithers Workflow Engine
TypeScript framework for deterministic, resumable AI workflows using JSX. Runtime: Bun >= 1.3. State: SQLite via Drizzle ORM. Validation: Zod schemas. Version: 0.8.2.
Init CLI
uv run /path/to/skills/smithers/scripts/skill_smithers.py <target-dir>
uv run scripts/skill_smithers.py ./scripts/my-workflow --name my-workflow --no-install
Copies the full template into <target-dir>, runs bun install, and initializes git + jj (skips each if already present). Requires jj on PATH: brew install jj.
Project Config Setup
The workflow engine (<target-dir>/) is project-agnostic — never edit it per project. All domain-specific content lives in a separate config directory:
projects/<my-project>/smithers-config/
config.ts # ProjectConfig (name, phases, cwd, instructions, agents)
instructions/ # MDX files injected into component prompts
research.mdx, plan.mdx, implement.mdx, test.mdx,
code-review.mdx, prd-review.mdx, review-fix.mdx,
final-review.mdx, update-progress.mdx
output/
context/ # agents write context docs here (create empty)
plans/ # agents write plan docs here (create empty)
run.sh # launcher
config.ts must satisfy the ProjectConfig interface (see <target-dir>/types/project.ts). Key fields:
phases— ordered list of{ id, name, description, metadata? }— the outer loop iterates over thesecwd— absolute path to the repo root where agents runinstructions— map of step name → imported MDX function (e.g.research: ResearchInstructions)agents.systemPromptContent— project context injected into every agent's system prompt
instructions/*.mdx are additive — they fill the {props.projectInstructions} slot in each generic component prompt. Write only what's domain-specific (spec references, file paths, rules). Phase metadata fields are available as {props.fieldName}.
run.sh sets SMITHERS_PROJECT and invokes the engine:
#!/usr/bin/env bash
SCRIPT_DIR="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
cd "<target-dir>"
export SMITHERS_PROJECT="$SCRIPT_DIR/config.js"
bunx smithers run workflow.tsx
Run with: ./projects/<my-project>/smithers-config/run.sh
Core Execution Model
Render-schedule-execute loop:
- Render — JSX tree → available tasks
- Schedule — determine runnable tasks (dependencies + concurrency)
- Execute — agent runs, output validates against Zod, persists to SQLite
- Re-render — tree updates with new outputs, unblocking dependents
- Repeat until done/failed
JSX Primitives
| Component | Purpose |
|---|---|
<Workflow> |
Root container |
<Task> |
Single agent work unit with schema validation |
<Sequence> |
Sequential execution |
<Parallel> |
Concurrent execution with maxConcurrency |
<Ralph> |
Loop with until condition + maxIterations |
<Branch> |
Conditional routing |
<Worktree> |
Run tasks in an isolated git/jj worktree (auto-created if missing). Optional branch prop creates/resets a named branch in the worktree, making restarts idempotent. |
Task Props
| Prop | Type | Purpose |
|---|---|---|
id |
string |
Unique node ID (phase-prefix for multi-phase loops) |
output |
ZodObject |
Required. Pass outputs.xxx from createSmithers. Enables schema validation, auto-retry, and JSON instructions. Never pass a string — that API was removed in v0.7.1. |
agent |
AgentLike | AgentLike[] |
Agent or array of agents [primary, fallback1, fallback2, ...]. Tries in order: attempt 1 uses agents[0], attempt 2 uses agents[1], etc. (capped at last). Replaces the old fallbackAgent prop. |
retries |
number |
Retry budget on failure |
timeoutMs |
number |
Per-attempt timeout in milliseconds |
skipIf |
boolean |
Skip execution when true |
continueOnFail |
boolean |
Don't block siblings when this task fails |
Available Agents
| Agent | Import | CLI Required | Notes |
|---|---|---|---|
ClaudeCodeAgent |
smithers-orchestrator |
claude |
Default reviewer/researcher |
CodexAgent |
smithers-orchestrator |
codex |
Default implementer |
GeminiAgent |
smithers-orchestrator |
gemini |
Google Gemini CLI; json output format by default (v0.8.2, changed from text) |
KimiAgent |
smithers-orchestrator |
kimi |
Kimi CLI; thinking=true and text output by default (v0.8.2, reverted from stream-json); --final-message-only auto-enabled; parallel runs use isolated share dirs; supports agent: "okabe", MCP configs |
AmpAgent |
smithers-orchestrator |
amp |
Amp CLI; supports thread (continue existing thread), visibility, mcpConfig, dangerouslyAllowAll |
PiAgent |
smithers-orchestrator |
pi |
Pi CLI |
Key Rules
- Always pass
output={outputs.xxx}to<Task>— enables schema validation, auto-retry, and auto-appended JSON output instructions. Pass the ZodObject fromcreateSmithers'soutputsreturn — string keys were removed in v0.7.1. - Use
ctx.outputMaybe()notctx.output()— gracefully handles missing outputs during first render - Use
ctx.latest()for cross-iteration decisions —outputMaybeis scoped to the current Ralph iteration; usectx.latest(table, nodeId)forskipIf, loopuntil, andallPhasesCompletechecks - Use
.nullable()never.optional()in Zod schemas — OpenAI structured outputs rejects.optional()fields - Phase-prefix nodeIds when multiple phases share a loop —
${phaseId}:step-name - Set
continueOnFailon review Tasks — one reviewer failing shouldn't block the other - Use agent arrays for rate-limit resilience — set
agent={[primary, fallback]}on heavy tasks so retry attempts switch to a different model automatically. The oldfallbackAgentprop was removed in v0.8.0.
Workflow Philosophy
Atomic unit pipeline: run the full pipeline (implement → test → review → fix → final-review) for each logically complete unit of work. Do NOT implement everything then review once.
"Atomic" = one logical concern, not smallest diff: 20 similar operators = ONE unit. Porting 14 similar test cases = ONE unit. The outer Ralph loop iterates until all phases pass the FinalReview gate.
Resources
- references/atomic-workflow.md — logical unit pipeline, nextLogicalUnit chaining, batching heuristics
- references/patterns.md — outer Ralph, gating, data threading, pass tracking, dual-model review
- references/troubleshooting.md — dirty git, OpenAI schema errors, stale runs, SQLite debugging
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
tdd
Test-driven development with red-green-refactor loop. Use when user wants to build features or fix bugs using TDD, mentions "red-green-refactor", wants integration tests, or asks for test-first development.
sdk-documentation
Rules and patterns for writing comprehensive, high-quality SDK documentation for public libraries. Covers documentation architecture, narrative tone, user guides, and API references. Use when: (1) Writing or reviewing documentation for a public SDK/library, (2) Creating API reference pages for hooks/functions/classes, (3) Writing getting-started guides or tutorials, (4) Structuring a documentation site from scratch, (5) Reviewing documentation quality and consistency, (6) Setting up a VitePress or GitBook documentation site for an SDK.
super-ralph
Build multi-phase AI development pipelines with the Smithers workflow engine (v0.8.2). Use when: (1) Setting up a SuperRalph workflow for a repo (focuses, focusDirs, focusTestSuites, agents) (2) Debugging a run (ticket explosion, duplicate tickets, stalled nodes) (3) Understanding the pipeline phases and what generates tickets (4) Avoiding common configuration mistakes that cause runaway ticket counts
prd-authoring
tanstack-best-practices
Best practices for building hook libraries with TanStack Query. Use when: (1) Writing useQuery/useMutation hooks that wrap async data-fetching functions, (2) Designing query key schemas and cache identity systems, (3) Building framework-agnostic query options factories, (4) Implementing cache invalidation patterns (invalidate vs remove vs setQueryData), (5) Wrapping TanStack Query in a multi-layered library (core actions to query options to framework hooks), (6) Handling non-serializable values (bigint, class instances) in query keys, (7) Bridging external stores (zustand, signals) with TanStack Query reactivity. Derived from wagmi's production architecture (React/Vue/Solid Ethereum hooks).
effect-best-practices
Enforces Effect-TS patterns for services, errors, layers, and atoms. Use when writing code with Effect.Service, Schema.TaggedError, Layer composition, or effect-atom React components.
Didn't find tool you were looking for?