Agent skill

alphafold

Validate protein designs using AlphaFold2 structure prediction. Use this skill when: (1) Validating designed sequences fold correctly, (2) Predicting binder-target complex structures, (3) Calculating confidence metrics (pLDDT, pTM, ipTM), (4) Self-consistency validation of designs, (5) Multi-chain complex prediction with AlphaFold-Multimer. For faster single-chain prediction, use esm. For QC thresholds, use protein-qc.

View SKILL.md on GitHub Repository

Stars 125

Forks 14

Install this agent skill to your Project

npx add-skill https://github.com/adaptyvbio/protein-design-skills/tree/main/skills/alphafold

SKILL.md

AlphaFold2 Structure Validation

Prerequisites

Requirement	Minimum	Recommended
Python	3.8+	3.10
CUDA	11.0+	12.0+
GPU VRAM	32GB	40GB (A100)
RAM	32GB	64GB
Disk	100GB	500GB (for databases)

How to run

First time? See Installation Guide to set up Modal and biomodals.

Option 1: ColabFold (recommended for multimer)

bash

cd biomodals
modal run modal_colabfold.py \
  --input-faa sequences.fasta \
  --out-dir output/

GPU: A100 (40GB) | Timeout: 3600s default

Option 2: Local installation

bash

git clone https://github.com/deepmind/alphafold.git
cd alphafold

python run_alphafold.py \
  --fasta_paths=query.fasta \
  --output_dir=output/ \
  --model_preset=monomer \
  --max_template_date=2026-01-01

Option 3: ESMFold (fast single-chain)

bash

modal run modal_esmfold.py \
  --sequence "MKTAYIAKQRQISFVK..."

Key parameters

Parameter	Default	Options	Description
`--model_preset`	monomer	monomer/multimer	Model type
`--num_recycle`	3	1-20	Recycling iterations
`--max_template_date`	-	YYYY-MM-DD	Template cutoff
`--use_templates`	True	True/False	Use template search

Output format

output/
├── ranked_0.pdb           # Best model
├── ranked_1.pdb           # Second best
├── ranking_debug.json     # Confidence scores
├── result_model_1.pkl     # Full results
├── msas/                  # MSA files
└── features.pkl           # Input features

Extracting metrics

python

import pickle

with open('result_model_1.pkl', 'rb') as f:
    result = pickle.load(f)

plddt = result['plddt']
ptm = result['ptm']
iptm = result.get('iptm', None)  # Multimer only
pae = result['predicted_aligned_error']

Sample output

Successful run

$ python run_alphafold.py --fasta_paths complex.fasta --model_preset multimer
[INFO] Running MSA search...
[INFO] Running model 1/5...
[INFO] Running model 5/5...
[INFO] Relaxing structures...

Results:
  ranked_0.pdb:
    pLDDT: 87.3 (mean)
    pTM: 0.78
    ipTM: 0.62
    PAE (interface): 8.5

Saved to output/

What good output looks like:

pLDDT: > 85 (mean, on 0-100 scale) or > 0.85 (normalized)
pTM: > 0.70
ipTM: > 0.50 for complexes
PAE_interface: < 10

Decision tree

Should I use AlphaFold?
│
├─ What are you predicting?
│  ├─ Single protein → ESMFold (faster)
│  ├─ Protein-protein complex → AlphaFold/ColabFold ✓
│  ├─ Protein + ligand → Chai or Boltz
│  └─ Batch of sequences → ColabFold ✓
│
├─ What do you need?
│  ├─ Highest accuracy → AlphaFold/ColabFold ✓
│  ├─ Fast screening → ESMFold
│  └─ MSA-free prediction → Chai or ESMFold
│
└─ Which AF2 option?
   ├─ Local installation → Full control, slow setup
   ├─ ColabFold → Easier, MSA server
   └─ Modal → Recommended for batch

Typical performance

Campaign Size	Time (A100)	Cost (Modal)	Notes
100 complexes	1-2h	~$8	With MSA server
500 complexes	5-10h	~$40	Standard campaign
1000 complexes	10-20h	~$80	Large campaign

Per-complex: ~30-60s with MSA server.

Verify

bash

find output -name "ranked_0.pdb" | wc -l  # Should match input count

Troubleshooting

Low pLDDT regions: May indicate disorder or poor design Low ipTM: Interface not confident, check hotspots High PAE off-diagonal: Chains may not interact OOM errors: Use ColabFold with MSA server instead

Error interpretation

Error	Cause	Fix
`RuntimeError: CUDA out of memory`	Sequence too long	Use A100 or split prediction
`KeyError: 'iptm'`	Running monomer on complex	Use multimer preset
`FileNotFoundError: database`	Missing MSA databases	Use ColabFold MSA server
`TimeoutError`	MSA search slow	Reduce num_recycles

Next: protein-qc for filtering and ranking.

Maintainer

adaptyvbio Core maintainer

Source details

Full Name: adaptyvbio/protein-design-skills
Branch: main
Path in repo: skills/alphafold
License: MIT License
Topics: claude-code agent-skills protein-design protein-engineering

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

adaptyvbio/protein-design-skills

proteinmpnn

Design protein sequences using ProteinMPNN inverse folding. Use this skill when: (1) Designing sequences for RFdiffusion backbones, (2) Redesigning existing protein sequences, (3) Fixing specific residues while designing others, (4) Optimizing sequences for expression or stability, (5) Multi-state or negative design. For backbone generation, use rfdiffusion or bindcraft. For ligand-aware design, use ligandmpnn. For solubility optimization, use solublempnn.

125 14

Explore

adaptyvbio/protein-design-skills

campaign-manager

Goal-oriented binder design campaign planning and health assessment. Use this skill when: (1) Planning a complete binder design campaign, (2) Converting high-level goals into runnable pipelines, (3) Assessing campaign health and pass rates, (4) Diagnosing why designs are failing QC, (5) Estimating time, cost, and expected yields, (6) Selecting between design tools for a specific target. This skill orchestrates the other protein design tools. For individual tool parameters, use the specific tool skills.

125 14

Explore

adaptyvbio/protein-design-skills

esm

ESM2 protein language model for embeddings and sequence scoring. Use this skill when: (1) Computing pseudo-log-likelihood (PLL) scores, (2) Getting protein embeddings for clustering, (3) Filtering designs by sequence plausibility, (4) Zero-shot variant effect prediction, (5) Analyzing sequence-function relationships. For structure prediction, use chai or boltz. For QC thresholds, use protein-qc.

125 14

Explore

adaptyvbio/protein-design-skills

binding-characterization

Guidance for SPR and BLI binding characterization experiments. Use when: (1) Planning binding kinetics experiments, (2) Troubleshooting poor/no binding signal, (3) Interpreting kinetic data artifacts, (4) Choosing between SPR vs BLI platforms.

125 14

Explore

adaptyvbio/protein-design-skills

cell-free-expression

Guidance for cell-free protein synthesis (CFPS) optimization. Use when: (1) Planning CFPS experiments, (2) Troubleshooting low yield or aggregation, (3) Optimizing DNA template design for CFPS, (4) Expressing difficult proteins (disulfide-rich, toxic, membrane).

125 14

Explore

adaptyvbio/protein-design-skills

ligandmpnn

Ligand-aware protein sequence design using LigandMPNN. Use this skill when: (1) Designing sequences around small molecules, (2) Enzyme active site design, (3) Ligand binding pocket optimization, (4) Metal coordination site design, (5) Cofactor binding proteins. For standard protein design, use proteinmpnn. For solubility optimization, use solublempnn.

125 14

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

AlphaFold2 Structure Validation

Prerequisites

How to run

Option 1: ColabFold (recommended for multimer)

Option 2: Local installation

Option 3: ESMFold (fast single-chain)

Key parameters

Output format

Extracting metrics

Sample output

Successful run

Decision tree

Typical performance

Verify

Troubleshooting

Error interpretation

Recommended Agent Skills

proteinmpnn

campaign-manager

esm

binding-characterization

cell-free-expression

ligandmpnn