Agent skills
distributed-training

Agent skill

distributed-training

Multi-GPU and distributed training patterns with PyTorch DDP. Use when scaling training across GPUs.

Stars 11,027

Forks 1,262

Install this agent skill to your Project

npx add-skill https://github.com/aiming-lab/AutoResearchClaw/tree/main/researchclaw/skills/builtin/tooling/distributed-training

Metadata

Additional technical details for this skill

author: researchclaw
version: 1.0
category: tooling
priority: 7
references: PyTorch DDP Tutorial, pytorch.org; Goyal et al., Accurate Large Minibatch SGD, 2017
trigger keywords: distributed,multi-gpu,parallel,ddp,scale
applicable stages: 10,12

SKILL.md

Distributed Training Best Practice

Use DistributedDataParallel (DDP) over DataParallel for multi-GPU
Initialize process group: dist.init_process_group(backend='nccl')
Use DistributedSampler for data sharding
Synchronize batch norm: nn.SyncBatchNorm.convert_sync_batchnorm()
Only save checkpoint on rank 0
Scale learning rate linearly with world size
Use gradient accumulation for effectively larger batch sizes

Maintainer

aiming-lab Core maintainer

Source details

Full Name: aiming-lab/AutoResearchClaw
Branch: main
Path in repo: researchclaw/skills/builtin/tooling/distributed-training
License: MIT License
Topics: openclaw llm-agents self-evolving autonomous-research metaclaw paper-generation scientific-discovery citation-verification multi-agent-debate

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

aiming-lab/AutoResearchClaw

scientific-visualization

Publication-ready scientific figure design with matplotlib and seaborn. Use when creating journal submission figures with proper formatting, accessibility, and statistical annotations.

11,027 1,262

Explore

aiming-lab/AutoResearchClaw

hypothesis-formulation

Structured scientific hypothesis generation from observations. Use when formulating testable hypotheses, competing explanations, or experimental predictions.

11,027 1,262

Explore

aiming-lab/AutoResearchClaw

scientific-writing

Academic manuscript writing with IMRAD structure, citation formatting, and reporting guidelines. Use when drafting or revising research papers.

11,027 1,262

Explore

aiming-lab/AutoResearchClaw

a-evolve

Apply A-Evolve's agentic evolution methodology to improve AI agent performance across runs. Use when the user wants to diagnose agent failures, generate targeted skills from error patterns, evolve system prompts, or accumulate episodic knowledge. Works standalone or inside AutoResearchClaw pipelines. Triggers on: "evolve", "self-improve", "diagnose failures", "generate skills from errors", "what went wrong and how to fix it", or any mention of A-Evolve.

11,027 1,262

Explore

aiming-lab/AutoResearchClaw

chemistry-rdkit

Computational chemistry with RDKit for molecular analysis, descriptors, fingerprints, and substructure search. Use when working with SMILES, drug discovery, or cheminformatics tasks.

11,027 1,262

Explore

aiming-lab/AutoResearchClaw

literature-search

Systematic literature review methodology including search strategy, screening, and synthesis. Use when conducting literature reviews or writing background sections.

11,027 1,262

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

Metadata

SKILL.md

Distributed Training Best Practice

Recommended Agent Skills

scientific-visualization

hypothesis-formulation

scientific-writing

a-evolve

chemistry-rdkit

literature-search