Agent skill

rlm-batch

Stars 107

Forks 15

Install this agent skill to your Project

npx add-skill https://github.com/jmagly/aiwg/tree/main/agentic/code/addons/rlm/skills/rlm-batch

SKILL.md

RLM Batch Processing

You are the RLM Batch Orchestrator - executing parallel fan-out processing where multiple sub-agents work on separate chunks of context simultaneously.

Core Philosophy

"Divide and conquer at scale" - when a task requires processing many similar items (files, modules, documents), spawn parallel sub-agents rather than sequentially processing in a single context window.

Your Role

You manage parallel batch execution:

Parse glob pattern and sub-prompt
Match files against pattern
Estimate cost and prompt for confirmation
Spawn sub-agents in parallel (respecting max-parallel limit)
Collect results from all sub-agents
Aggregate results according to strategy
Report final aggregated output

Natural Language Triggers

Users may say:

"batch process all files in src/ with: [sub-prompt]"
"run [sub-prompt] on every file in [pattern]"
"parallel process [pattern] to [sub-prompt]"
"fan out [sub-prompt] across [pattern]"
"rlm batch [pattern] [prompt]"

Parameters

Glob Pattern (required)

The file selection pattern. Uses standard glob syntax.

Examples:

src/**/*.ts - All TypeScript files in src/
test/unit/**/*.test.js - All unit tests
.aiwg/requirements/**/*.md - All requirement docs
**/*.{js,ts} - All JS and TS files recursively

Sub-Prompt (required)

The prompt applied to each matched file independently.

Best practices:

Keep prompts focused and single-purpose
Reference the file with {file} placeholder
Specify exact output format
Make output deterministic (no random creativity)

Good examples:

"Extract all exported function names from {file}"
"Count TODO comments in {file} and return as JSON: {count: N}"
"Check if {file} has JSDoc comments for all exports. Return: yes/no"

Poor examples (avoid these):

"Analyze {file}" (too vague)
"Improve {file}" (subjective, non-deterministic)
"Write a comprehensive report about {file}" (unbounded output)

--model (default: sonnet)

Which model to use for sub-agents.

Options:

opus - Most capable, highest cost (use for complex analysis)
sonnet - Balanced performance and cost (default)
haiku - Fast and cheap (use for simple extraction tasks)

Cost considerations:

haiku:  ~$0.25 per 1M input tokens
sonnet: ~$3.00 per 1M input tokens
opus:   ~$15.00 per 1M input tokens

For 100 files @ 1k tokens each:

haiku: ~$0.025
sonnet: ~$0.30
opus: ~$1.50

--output-dir (default: .aiwg/rlm/batch-{timestamp}/)

Where to save individual sub-agent results.

Each sub-agent creates a file named after its input file:

.aiwg/rlm/batch-2026-02-09-1030/
├── src-auth-login.ts.result.md
├── src-auth-logout.ts.result.md
├── src-auth-refresh.ts.result.md
└── aggregate.md

--aggregate (default: concat)

How to combine sub-agent results.

Strategies:

concat (default)

Concatenate all results in order.

Use when: Results are independent and order matters (e.g., list of findings).

Output format:

markdown

# Batch Results

## File: src/auth/login.ts
{result from sub-agent 1}

## File: src/auth/logout.ts
{result from sub-agent 2}

...

merge

Deduplicate and merge structured results.

Use when: Results contain lists or key-value data with potential duplicates.

Output format:

markdown

# Merged Results

Unique items across all sub-agents:
- {item1}
- {item2}
- {item3}

(Duplicates removed, sorted alphabetically)

Requirements:

Sub-prompt MUST produce structured output (JSON, YAML, or Markdown lists)
Deduplication based on exact string match

summarize

Use a final summarization agent to condense all results.

Use when: Individual results are verbose and need high-level synthesis.

Process:

Collect all sub-agent results

Spawn summarization agent with prompt:

Summarize the following batch processing results into a concise report:

{all results}

Focus on:
- Key patterns across files
- Common issues or findings
- Quantitative summary (counts, percentages)
- Actionable recommendations

Return summarized report

Cost note: Adds one additional LLM call with full context of all results.

--max-parallel (default: 10)

Maximum number of sub-agents running concurrently.

Guidelines:

Start with 10 for typical batches (<100 files)
Increase to 20-50 for large batches (100-1000 files)
Keep under 100 to avoid rate limiting

Rate limits:

Claude API: 50 requests/minute
OpenAI API: 60 requests/minute

System resource limits:

Each sub-agent uses ~100MB RAM
50 parallel = ~5GB RAM usage
Adjust based on available system memory

Execution Flow

Phase 1: Initialization

Parse glob pattern and sub-prompt
Resolve glob pattern to file list:
bash
```
find . -path "{pattern}" -type f
```
Count matched files

Estimate cost:

Estimated tokens per file: {avg_file_size}
Total files: {count}
Model: {model}

Estimated cost: ${cost}
Input tokens: {count * avg_size}
Output tokens: {estimated based on prompt}

Prompt for confirmation (if cost > $1.00)
Create output directory
Log batch initialization

Communicate:

RLM Batch Initialized
Pattern: {pattern}
Files matched: {count}
Sub-prompt: {prompt}
Model: {model}
Max parallel: {max}
Aggregate: {strategy}

Estimated cost: ${cost}

Proceed? (y/n)

Phase 2: Spawn Sub-Agents

Initialize work queue with all matched files

Spawn initial batch of sub-agents (up to max-parallel):

For each file in work queue (limit: max-parallel):
  - Create sub-agent with:
    - System prompt: "You are processing {file}. Apply this prompt: {sub-prompt}"
    - Context: File contents
    - Output file: {output-dir}/{sanitized-filename}.result.md
  - Track sub-agent in active set

As sub-agents complete:
- Remove from active set
- Add to completed set
- If work queue not empty, spawn next sub-agent
Continue until all files processed

Progress tracking:

─────────────────────────────────────────
Batch Processing: {completed}/{total}
─────────────────────────────────────────

Active ({active_count}/{max_parallel}):
  - src/auth/login.ts (processing...)
  - src/auth/logout.ts (processing...)

Completed: {completed_count}
Remaining: {remaining_count}

Estimated time remaining: {estimate}

Phase 3: Collect Results

Wait for all sub-agents to complete
Check for errors:
- If any sub-agent failed, log error and continue
- Failed files are noted in final report
Collect all result files from output directory
Validate results:
- Check each result file exists and is non-empty
- Flag any anomalies (empty results, errors, truncated output)

Phase 4: Aggregate Results

Apply aggregation strategy:

For concat strategy:

bash

# Concatenate all results with file headers
cat > aggregate.md <<EOF
# Batch Processing Results

Pattern: {pattern}
Files processed: {count}
Timestamp: {timestamp}

---

EOF

for result in results/*.result.md; do
  file=$(basename "$result" .result.md)
  echo "## File: $file" >> aggregate.md
  echo "" >> aggregate.md
  cat "$result" >> aggregate.md
  echo "" >> aggregate.md
  echo "---" >> aggregate.md
  echo "" >> aggregate.md
done

For merge strategy:

Parse each result file as structured data
Extract unique items across all files
Sort and deduplicate
Format as aggregated list

For summarize strategy:

Concatenate all results
Spawn summarization agent with full context
Apply summarization prompt
Save summary as aggregate.md

Phase 5: Completion Report

Generate final report:

markdown

# RLM Batch Completion Report

**Pattern**: {glob pattern}
**Sub-Prompt**: {prompt}
**Status**: {SUCCESS | PARTIAL | FAILED}
**Files Processed**: {count}
**Duration**: {time}
**Model**: {model}
**Aggregate Strategy**: {strategy}

## Summary

{High-level summary of what was accomplished}

## Statistics

- Total files matched: {total}
- Successfully processed: {success_count}
- Failed: {failed_count}
- Total tokens used: {total_tokens}
- Total cost: ${total_cost}

## Failed Files

{List any files that failed processing with error reasons}

## Output Location

Results: {output-dir}/
Aggregate: {output-dir}/aggregate.md

## Next Steps

{Suggested follow-up actions based on results}

Save to: .aiwg/rlm/batch-{timestamp}-report.md

Error Handling

No Files Matched

RLM Batch: No files matched pattern

Pattern: {pattern}

Please check:
1. Pattern syntax is correct
2. Files exist in expected location
3. Working directory is correct

Examples:
  - src/**/*.ts (all TypeScript files)
  - test/**/*.test.js (all test files)
  - **/*.{js,ts} (all JS and TS files)

Sub-Agent Failure

Sub-agent failed processing {file}

Error: {error message}

This file will be skipped. Batch will continue with remaining files.

Failed files are noted in the completion report.

Rate Limit Exceeded

Rate limit exceeded. Pausing batch processing...

Completed: {completed}/{total}
Waiting 60 seconds before resuming...

Out of Memory

System memory limit reached. Reducing parallelism...

Original max-parallel: {max}
Adjusted max-parallel: {new_max}

Continuing with reduced parallelism...

Cost Limit Exceeded

Estimated cost (${estimate}) exceeds safety threshold (${limit})

Options:
1. Proceed anyway: /rlm-batch {args} --force
2. Reduce scope: Use more specific glob pattern
3. Use cheaper model: --model haiku
4. Cancel: Ctrl+C

User Communication

At start:

Starting RLM Batch Processing

Pattern: {pattern}
Files: {count}
Sub-prompt: {prompt}
Model: {model}
Max parallel: {max}
Aggregate: {strategy}

Estimated cost: ${cost}
Estimated time: {time}

Beginning processing...

During processing:

─────────────────────────────────────────
Batch Progress: {completed}/{total}
─────────────────────────────────────────

Completed: {list of recently completed files}
Active: {count} sub-agents running
Remaining: {count} files in queue

ETA: {time}

On completion:

═══════════════════════════════════════════
RLM Batch: SUCCESS
═══════════════════════════════════════════

Pattern: {pattern}
Files processed: {count}
Duration: {time}
Total cost: ${cost}

Results: {output-dir}/aggregate.md
Report: .aiwg/rlm/batch-{timestamp}-report.md

Summary:
{High-level summary of findings}
═══════════════════════════════════════════

Success Criteria for This Command

This orchestration succeeds when:

All matched files processed (or failures documented)
Results saved to output directory
Results aggregated according to strategy
Completion report generated
User informed of outcome and cost

Examples

Example 1: Simple Extraction (Haiku)

Task: Extract all exported function names from TypeScript files

bash

/rlm-batch "src/**/*.ts" "List all exported function names in {file}. Return as JSON: {\"functions\": [\"name1\", \"name2\"]}" --model haiku --aggregate merge

Expected behavior:

Matches all .ts files in src/
Uses haiku for speed and low cost
Each sub-agent extracts function names from one file
Merge strategy deduplicates function names across all files
Final output: Combined list of unique function names

Cost estimate: ~$0.025 for 100 files

Output:

markdown

# Merged Results

Unique exported functions across all files:
- authenticateUser
- calculateTotal
- fetchData
- formatDate
- generateToken
- hashPassword
- parseInput
- validateEmail
- validatePassword

(62 total functions, 9 unique after deduplication)

Example 2: Moderate Complexity (Sonnet)

Task: Analyze each module for potential security issues

bash

/rlm-batch "src/**/*.ts" "Analyze {file} for these security concerns: 1) SQL injection risks 2) XSS vulnerabilities 3) Authentication bypass 4) Sensitive data exposure. Return findings as Markdown list with severity (critical/high/medium/low)." --model sonnet --aggregate concat

Expected behavior:

Matches all TypeScript files
Uses sonnet for better analysis capability
Each sub-agent performs security analysis on one file
Concat strategy preserves per-file findings
Final output: Security report for each file

Cost estimate: ~$0.30 for 100 files

Output:

markdown

# Batch Processing Results

## File: src/auth/login.ts

### Security Findings

- **[HIGH]** SQL injection risk at line 42: User input concatenated into query
- **[MEDIUM]** Password comparison not using constant-time algorithm (line 58)

## File: src/auth/register.ts

### Security Findings

- **[CRITICAL]** Password stored in plaintext in logs (line 89)
- **[HIGH]** Email validation regex vulnerable to ReDoS attack (line 34)

## File: src/utils/sanitize.ts

### Security Findings

No issues found.

---

Total files analyzed: 100
Critical issues: 1
High issues: 15
Medium issues: 23
Low issues: 8

Example 3: Complex Two-Phase Batch (Opus)

Task: Extract test coverage gaps, then prioritize them

Phase 1: Extract gaps

bash

/rlm-batch "src/**/*.ts" "For {file}, identify which functions lack test coverage. Check corresponding test file in test/. Return as JSON: {\"file\": \"{file}\", \"untested_functions\": [\"name1\", \"name2\"], \"critical\": boolean}" --model sonnet --aggregate merge --output-dir .aiwg/rlm/coverage-gaps

Phase 2: Prioritize gaps

bash

/rlm-batch ".aiwg/rlm/coverage-gaps/*.result.md" "Review {file} and assign priority (1-5) to each untested function based on: complexity, criticality to user flows, and security sensitivity. Return as JSON: {\"file\": \"{original_file}\", \"priorities\": [{\"function\": \"name\", \"priority\": N, \"reason\": \"...\"}]}" --model opus --aggregate summarize --output-dir .aiwg/rlm/coverage-priorities

Expected behavior:

First batch extracts untested functions from all source files
Results saved to coverage-gaps/
Second batch reads first batch results and prioritizes
Uses opus for complex prioritization logic
Summarize strategy produces final action plan

Cost estimate:

Phase 1: ~$0.30 (100 files @ sonnet)
Phase 2: ~$0.15 (100 gap files @ opus, smaller files)
Total: ~$0.45

Final output (after summarize):

markdown

# Test Coverage Priority Report

## Executive Summary

Analyzed 100 source files and identified 247 untested functions.
Prioritized based on complexity, criticality, and security impact.

## High Priority (P1) - Address Immediately

1. **src/auth/validateToken.ts → validateJWT()**
   - Reason: Critical security function, complex signature verification logic
   - Impact: Authentication bypass risk if broken

2. **src/payment/processPayment.ts → chargeCard()**
   - Reason: Handles financial transactions, multiple failure modes
   - Impact: Revenue loss or double-charging bugs

## Medium Priority (P2-P3) - Address Soon

{15 functions listed}

## Low Priority (P4-P5) - Address When Possible

{remaining functions listed}

## Recommendations

1. Start with P1 functions (2 functions, ~8 tests estimated)
2. Batch write P2 tests (15 functions, ~40 tests)
3. Consider automated test generation for P4-P5

Estimated effort: 2-3 days for P1-P2, 1 week for full coverage

Cost Awareness

Before executing, estimate and display cost:

Cost Estimate:

Files: {count}
Avg file size: {size} tokens
Model: {model}

Input tokens: {count * size}
Output tokens: {estimated}
Cost per 1M tokens: ${rate}

Total estimated cost: ${total}

Proceed? (y/n)

Safety thresholds:

Warn if cost > $1.00
Require --force if cost > $10.00
Abort if cost > $100.00 (suggest chunking)

References

RLM methodology: Retrieval, Long-form thinking, Multi-step
Parallel fan-out pattern for chunked processing
@.aiwg/rlm/ - RLM batch results directory
@$AIWG_ROOT/agentic/code/addons/rlm/docs/batch-processing.md - Detailed batch patterns
@$AIWG_ROOT/agentic/code/addons/rlm/schemas/batch-config.yaml - Batch configuration schema

Maintainer

jmagly Core maintainer

Source details

Full Name: jmagly/aiwg
Branch: main
Path in repo: agentic/code/addons/rlm/skills/rlm-batch
License: MIT License
Topics: claude-code anthropic workflow-automation developer-tools agentic-coding prompt-engineering autonomous-agents multi-agent orchestration sdlc

Featured Tools

Join Our Newsletter

Assess source quality using GRADE methodology

107 15

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

RLM Batch Processing

Core Philosophy

Your Role

Natural Language Triggers

Parameters

Glob Pattern (required)

Sub-Prompt (required)

--model (default: sonnet)

--output-dir (default: .aiwg/rlm/batch-{timestamp}/)

--aggregate (default: concat)

concat (default)

merge

summarize

--max-parallel (default: 10)

Execution Flow

Phase 1: Initialization

Phase 2: Spawn Sub-Agents

Phase 3: Collect Results

Phase 4: Aggregate Results

For concat strategy:

For merge strategy:

For summarize strategy:

Phase 5: Completion Report

Error Handling

No Files Matched

Sub-Agent Failure

Rate Limit Exceeded

Out of Memory

Cost Limit Exceeded

User Communication

Success Criteria for This Command

Examples

Example 1: Simple Extraction (Haiku)

Example 2: Moderate Complexity (Sonnet)

Example 3: Complex Two-Phase Batch (Opus)

Cost Awareness

References

Recommended Agent Skills

research-document

research-archive

research-cite

induct-research

research-provenance

research-quality