Agent skill
pipeline
End-to-end source processing -- seed, reduce, process all claims through reflect/reweave/verify, archive. The full pipeline in one command. Triggers on "/pipeline", "/pipeline [file]", "process this end to end", "full pipeline".
Install this agent skill to your Project
npx add-skill https://github.com/agenticnotetaking/arscontexta/tree/main/skill-sources/pipeline
SKILL.md
EXECUTE NOW
Target: $ARGUMENTS
Parse immediately:
- Source file path: the file to process (required)
--handoff: output RALPH HANDOFF block at end (for chaining)- If target is empty: list files in {DOMAIN:inbox}/ and ask which to process
Step 0: Read Vocabulary
Read ops/derivation-manifest.md (or fall back to ops/derivation.md) for domain vocabulary mapping. All output must use domain-native terms. If neither file exists, use universal terms.
START NOW. Run the full pipeline.
Pipeline Overview
The pipeline chains four phases. Each phase uses skill invocation or /ralph for subagent-based processing. State lives in the queue file — the pipeline is stateless orchestration on top of stateful queue entries.
Source file
|
v
Phase 1: /seed — create extract task, move source to archive
|
v
Phase 2: /reduce (via /ralph) — extract claims from source
|
v
Phase 3: /ralph (all claims) — create -> reflect -> reweave -> verify
|
v
Phase 4: /archive-batch — move task files, generate summary
|
v
Complete
The pipeline is the convenience wrapper. /ralph is the engine. /seed is the entry point.
Phase 1: Seed
Invoke /seed on the target file to create the extract task, check for duplicates, and move the source to its archive folder.
How to invoke:
Use the Skill tool if available, otherwise execute the /seed workflow directly:
- Validate source exists
- Check for prior processing (duplicate detection)
- Create archive folder
- Move source from {DOMAIN:inbox} to archive
- Create extract task file
- Add extract task to queue
Capture from seed output:
- Batch ID: the source basename (used for --batch filtering in subsequent steps)
- Archive folder path: where the source was moved
- next_claim_start: the claim numbering start
Report: $ Seeded: {source-name}
If seed reports the file was already processed: Ask the user whether to proceed or skip. Do NOT auto-skip — the user may want to re-process with different scope.
Phase 2: Extract (Reduce)
Process the extract task via /ralph. This spawns a subagent that runs /reduce, extracting claims from the source and creating task entries in the queue.
How to invoke:
/ralph 1 --batch {batch_id} --type extract
Or via Task tool:
Task(
prompt = "Run /ralph 1 --batch {batch_id} --type extract",
description = "extract: {batch_id}"
)
After completion, read the queue to count extracted claims and enrichments:
Check how many pending tasks exist for this batch. The reduce phase creates 1 queue entry per claim and 1 per enrichment.
Report:
$ Extracted: {N} {DOMAIN:note_plural}, {M} enrichments
Processing {total_tasks} tasks through the pipeline...
If zero claims extracted: Report the issue. For TFT sources, zero extraction is a bug — the source almost certainly contains extractable content. Ask the user whether to retry with different scope or skip.
Phase 3: Process All Claims
Count total pending tasks for this batch from the queue. Then process all of them through the full phase sequence.
How to invoke:
/ralph {remaining_count} --batch {batch_id}
Or via Task tool:
Task(
prompt = "Run /ralph {remaining_count} --batch {batch_id}",
description = "process: {batch_id} ({remaining_count} tasks)"
)
This processes every claim through: create -> reflect -> reweave -> verify. And every enrichment through: enrich -> reflect -> reweave -> verify.
Each phase runs in an isolated subagent with fresh context. /ralph handles all the orchestration: subagent spawning, handoff parsing, queue advancement, learnings capture.
Progress reporting:
The /ralph invocation reports progress per task. The pipeline relays this:
$ Processing {DOMAIN:note} 1/{total}: {title}
$ create... done
$ reflect... done (3 connections found)
$ reweave... done (2 {DOMAIN:note_plural} updated)
$ verify... done (PASS)
For large batches (20+ claims): /ralph handles context isolation automatically via subagents. The pipeline does NOT need to chunk — /ralph processes N tasks sequentially with fresh context per phase.
Phase 4: Verify Completion
After /ralph finishes, verify all tasks for this batch are done.
Check the queue: count tasks for this batch that are NOT done.
If tasks remain pending:
- Report which tasks are incomplete and at which phase
- Show the specific task IDs and their current_phase
- Suggest: "Run
/ralph --batch {batch_id}to continue from where it stopped" - Do NOT proceed to archive
If all tasks are done: Proceed to Phase 5.
Phase 5: Archive Batch
When all tasks for the batch are complete, archive the batch.
How to invoke:
/archive-batch {batch_id}
Or execute directly:
- Move all task files from
ops/queue/toops/queue/archive/{date}-{batch_id}/ - Generate a batch summary file:
{batch_id}-summary.md - Remove completed entries from the queue (or mark as archived)
The summary should include:
- Source file name and original location
- Number of claims extracted
- Number of enrichments
- List of created {DOMAIN:note_plural} with titles
- Any notable learnings from the batch
Phase 6: Final Report
--=={ pipeline }==--
Source: {source_file}
Batch: {batch_id}
Extraction:
{DOMAIN:note_plural} extracted: {N}
Enrichments identified: {M}
Processing:
{DOMAIN:note_plural} created: {N}
Existing {DOMAIN:note_plural} enriched: {M}
Connections added: {C}
{DOMAIN:topic map}s updated: {T}
Older {DOMAIN:note_plural} updated via reweave: {R}
Quality:
All verify checks: {PASS/FAIL count}
Archive: ops/queue/archive/{date}-{batch_id}/
Summary: {batch_id}-summary.md
{DOMAIN:note_plural} created:
- [[claim title 1]]
- [[claim title 2]]
- ...
If --handoff flag was set, also output:
=== RALPH HANDOFF: pipeline ===
Target: {source_file}
Work Done:
- Seeded source: {batch_id}
- Extracted {N} {DOMAIN:note_plural} and {M} enrichments
- Processed all claims through 4-phase pipeline
- Archived batch to {archive_path}
Files Modified:
- {DOMAIN:notes}/ ({N} new {DOMAIN:note_plural})
- ops/queue/archive/{date}-{batch_id}/ (archived)
Learnings:
- [Friction]: {description} | NONE
- [Surprise]: {description} | NONE
- [Methodology]: {description} | NONE
- [Process gap]: {description} | NONE
Queue Updates:
- All tasks for batch {batch_id} marked done and archived
=== END HANDOFF ===
Error Handling
Phase failure at any stage:
- Report the failure with context (which phase, which task, what error)
- Show the current queue state for this batch
- Suggest remediation: "Run
/ralph --batch {batch_id}to continue from where it stopped" - Do NOT attempt to continue automatically past failures
The pipeline is resumable. Queue state persists across sessions:
- /seed detects prior processing and asks whether to proceed
- /ralph picks up from the last completed phase (queue is the source of truth)
- /archive-batch verifies completeness before archiving
Seed failure: If /seed fails (file not found, duplicate detected and user declines), stop the pipeline entirely.
Extract failure: If /reduce extracts zero claims, report and stop. Do not proceed to an empty processing phase.
Processing failure: If /ralph fails mid-batch, the queue preserves state. Individual claims resume from their failed phase on next /ralph invocation.
Archive failure: If archiving fails, the claims are still created and connected. Only the organizational cleanup is missing — re-run /archive-batch manually.
Resumability
The pipeline is designed to be interrupted and resumed at any point:
| Interrupted At | How to Resume |
|---|---|
| Before seed | Run /pipeline again (starts fresh) |
| After seed, before reduce | /ralph 1 --batch {id} --type extract |
| After reduce, during claims | /ralph --batch {id} (picks up from failed phase) |
| After all claims, before archive | /archive-batch {id} |
State lives in the queue file. The pipeline reads queue state, not session state. This means you can interrupt, close the session, and resume later.
Edge Cases
No target file: List {DOMAIN:inbox}/ candidates, suggest the best one based on age and relevance.
Source already seeded: /seed detects this and asks the user. If they decline, the pipeline stops cleanly.
Large source (2500+ lines): /reduce handles chunking automatically. The pipeline does not need special handling.
No ops/derivation-manifest.md: Use universal vocabulary for all output.
Critical Constraints
never:
- Skip the seed phase (duplicate detection is important)
- Continue past a failed phase automatically
- Process claims inline instead of via /ralph subagents
- Archive a batch with incomplete tasks
always:
- Report progress at each phase boundary
- Verify all tasks are done before archiving
- Show the user what was created (list of {DOMAIN:note_plural})
- Suggest next steps if interrupted
- Use domain-native vocabulary from derivation manifest
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
rethink
Challenge system assumptions against accumulated evidence. Triages observations and tensions, detects patterns, generates proposals. The scientific method applied to knowledge systems. Triggers on "/rethink", "review observations", "challenge assumptions", "what have I learned".
stats
Show vault statistics and knowledge graph metrics. Provides a shareable snapshot of vault health, growth, and progress. Triggers on "/stats", "vault stats", "show metrics", "how big is my vault".
graph
Interactive knowledge graph analysis. Routes natural language questions to graph scripts, interprets results in domain vocabulary, and suggests concrete actions. Triggers on "/graph", "/graph health", "/graph triangles", "find synthesis opportunities", "graph analysis".
reduce
Extract structured knowledge from source material. Comprehensive extraction is the default — every insight that serves the domain gets extracted. For domain-relevant sources, skip rate must be below 10%. Zero extraction from a domain-relevant source is a BUG. Triggers on "/reduce", "/reduce [file]", "extract insights", "mine this", "process this".
learn
Research a topic and grow your knowledge graph. Uses Exa deep researcher, web search, or basic search to investigate topics, files results with full provenance, and chains to processing pipeline. Triggers on "/learn", "/learn [topic]", "research this", "find out about".
reweave
Update old notes with new connections. The backward pass that /reflect doesn't do. Revisit existing notes that predate newer related content, add connections, sharpen claims, consider splits. Triggers on "/reweave", "/reweave [note]", "update old notes", "backward connections", "revisit notes".
Didn't find tool you were looking for?