Agent skill
nanoresearch-experiment
Generate a Python code skeleton from an experiment blueprint
Install this agent skill to your Project
npx add-skill https://github.com/OpenRaiser/NanoResearch/tree/main/skills/nanoresearch-experiment
SKILL.md
Experiment Skill
Purpose
Take the experiment blueprint and produce a runnable Python code skeleton that implements the proposed method, baselines, training loops, evaluation harness, and ablation configurations.
Tools Required
None. This skill operates entirely through LLM code generation based on the experiment blueprint.
Input
experiment_blueprint: Path topapers/experiment_blueprint.jsonproduced by the planning skill
Process
- Parse the experiment blueprint for datasets, baselines, metrics, and ablation groups
- Generate the project directory structure (data loaders, models, training, evaluation, configs)
- Produce data loading and preprocessing code for each specified dataset
- Implement model architecture stubs for the proposed method and each baseline
- Generate training loop with logging, checkpointing, and early stopping
- Implement the evaluation harness computing all specified metrics
- Create configuration files for each ablation group
- Add a main entry point that accepts a config and runs the full train-evaluate pipeline
Output
Produces experiments/ directory containing:
data/: Data loading and preprocessing modulesmodels/: Model architecture implementations (proposed method and baselines)training/: Training loop and optimization utilitiesevaluation/: Metric computation and result aggregationconfigs/: YAML configuration files for each experiment and ablation variantrun.py: Main entry point for launching experimentsrequirements.txt: Python dependencies
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
nanoresearch-planning
Produce an experiment blueprint from a research hypothesis
nanoresearch-ideation
Search academic literature and generate research hypotheses
skypilot-multi-cloud-orchestration
Multi-cloud orchestration for ML workloads with automatic cost optimization. Use when you need to run training or batch jobs across multiple clouds, leverage spot instances with auto-recovery, or optimize GPU costs across providers.
peft-fine-tuning
Parameter-efficient fine-tuning for LLMs using LoRA, QLoRA, and 25+ methods. Use when fine-tuning large models (7B-70B) with limited GPU memory, when you need to train <1% of parameters with minimal accuracy loss, or for multi-adapter serving. HuggingFace's official library integrated with transformers ecosystem.
huggingface-accelerate
Simplest distributed training API. 4 lines to add distributed support to any PyTorch script. Unified API for DeepSpeed/FSDP/Megatron/DDP. Automatic device placement, mixed precision (FP16/BF16/FP8). Interactive config, single launch command. HuggingFace ecosystem standard.
ml-training-recipes
Battle-tested PyTorch training recipes for all domains — LLMs, vision, diffusion, medical imaging, protein/drug discovery, spatial omics, genomics. Covers training loops, optimizer selection (AdamW, Muon), LR scheduling, mixed precision, debugging, and systematic experimentation. Use when training or fine-tuning neural networks, debugging loss spikes or OOM, choosing architectures, or optimizing GPU throughput.
Didn't find tool you were looking for?