Agent skill
sub-agent-delegation
Delegate complex tasks to sub-agents for parallel autonomous work. Use when GPU kernel optimization, numerical correctness verification, performance profiling, or long-running validation would benefit from focused independent execution.
Install this agent skill to your Project
npx add-skill https://github.com/Infatoshi/CLAUDE.md/tree/main/skills/sub-agent-delegation
SKILL.md
Sub-Agent Delegation
Permissions
- NEVER spawn without explicit permission
- ASK first: "I've identified [TASK] for sub-agent delegation. Should I spawn one?"
- Explain WHY before requesting
When to Delegate
- GPU kernel optimization with iterative benchmarking
- Numerical correctness verification across test cases
- Performance profiling and analysis
- Parallel investigation of independent code paths
- Long-running validation suites
Patterns
- Parallel: Optimize independent kernels simultaneously (attention to A, MLP to B)
- Correctness First: Make tests pass before performance
- Incremental: Iterate until target speedup or report blockers
Kernel Optimization Template
Optimize [OPERATION] in [FILE].
Context: [current impl], [bottleneck source], [target HW: 3090/H100], [use case: train/inference]
Requirements:
1. Implement with Triton/CUDA
2. Verify: torch.allclose(atol=1e-5, rtol=1e-5), gradients match autograd
3. Benchmark: warmup=10, bench=100, report min/max/mean/std us
4. Scales: (1,128), (8,512), (32,2048)
Report: correctness status, perf table (scale, baseline_us, opt_us, speedup), memory
Workflow
Setup -> Develop -> Verify -> Benchmark -> Report
Requirements
- Report measured numbers, never estimates
- Include methodology (warmup, iterations, sync)
- Flag regressions immediately
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
voltage-park
Provision and manage Voltage Park H100 GPU instances. Use when the user needs to spin up H100s, SSH into VP instances, transfer files, or terminate cloud GPU instances.
paper-implementation
Implement research papers from arxiv. Use when the user provides an arxiv link, asks to implement a paper, or wants to reproduce research results.
spec
Interview-driven specification development. Use when starting a new project, after context compaction, when SPEC.md is missing or stale, or when the user needs to clarify project architecture. Triggers on "spec", "interview me", "what are we building", or when substantial work is requested without clear requirements.
excalidraw-diagram
Generate Excalidraw diagrams. Use when the user asks to create a diagram, visualize a concept, or illustrate technical architectures.
handoff
Compact the current conversation into a handoff document for another agent to pick up.
edit-article
Edit and improve articles by restructuring sections, improving clarity, and tightening prose. Use when user wants to edit, revise, or improve an article draft.
Didn't find tool you were looking for?