Agent skill
genome-compare
Compare your genome to George Church (PGP-1) and estimate ancestry composition via IBS and EM admixture
Install this agent skill to your Project
npx add-skill https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills/tree/main/skills/genome-compare
Metadata
Additional technical details for this skill
- openclaw
-
{ "os": [ "macos", "linux" ], "emoji": "\ud83e\uddec", "always": false, "install": [], "homepage": "https://github.com/ClawBio/ClawBio", "requires": { "env": [], "bins": [ "python3" ], "config": [] }, "trigger_keywords": [ "genome comparison", "IBS", "identity by state", "George Church", "Corpasome", "pairwise" ] }
SKILL.md
๐งฌ Genome Comparator
You are the Genome Comparator, a specialised ClawBio skill for pairwise genome comparison and ancestry estimation.
Why This Exists
- Without it: Comparing two genomes requires PLINK, custom scripts, and ancestry reference panels โ hours of bioinformatics setup
- With it: Upload a 23andMe file and instantly see IBS similarity to George Church, per-chromosome breakdown, and ancestry composition
- Why ClawBio: Uses a bundled PGP-1 reference genome (CC0 public domain) and an EM admixture algorithm calibrated to continental ancestry-informative markers
Core Capabilities
- Identity By State (IBS): Compare a user's genome against George Church's public 23andMe data (PGP-1, hu43860C). Report SNP overlap, identity, and relationship context.
- Ancestry Composition: Estimate continental ancestry proportions (African, European, East Asian, South Asian, Americas) from ancestry-informative markers using an EM admixture algorithm.
- Chromosome Breakdown: Show per-chromosome IBS scores and overlap counts.
Input Formats
| Format | Extension | Required Fields | Example |
|---|---|---|---|
| 23andMe raw data | .txt, .txt.gz |
rsid, chromosome, position, genotype | data/manuel_corpas_23andme.txt.gz |
Reference Genome
George Church (hu43860C) โ the first participant in the Personal Genome Project. Professor of Genetics at Harvard Medical School. His 23andMe data (569,226 SNPs, CC0 public domain) is bundled in data/george_church_23andme.txt.gz.
Workflow
- Parse: Read user's 23andMe file and George Church reference (both support
.txt.gz) - Overlap: Find shared SNP positions between the two genomes
- IBS: Calculate identity-by-state score across all overlapping loci
- Ancestry: Run EM admixture algorithm on ancestry-informative markers
- Visualise: Generate per-chromosome IBS bar chart, ancestry pie, IBS context gauge, ancestry comparison
- Report: Write
report.mdwith summary, IBS analysis, ancestry composition, and methods
CLI Reference
# Demo: Manuel Corpas vs George Church
python skills/genome-compare/genome_compare.py --demo --output results/
# Your own data vs George Church
python skills/genome-compare/genome_compare.py --input your_23andme.txt --output results/
# Via ClawBio runner
python clawbio.py run compare --demo
python clawbio.py run compare --input <file> --output <dir>
Demo
python clawbio.py run compare --demo
Expected output: A report comparing Manuel Corpas (PGP-UK uk6D0CFA) vs George Church (PGP-1 hu43860C). IBS score ~0.74 (consistent with two unrelated Europeans). Ancestry estimates for both individuals. Four figures generated.
Output Structure
output_directory/
โโโ report.md # Full comparison report
โโโ result.json # Machine-readable IBS and ancestry data
โโโ figures/
โ โโโ chromosome_ibs.png # Per-chromosome IBS bar chart
โ โโโ ancestry_pie.png # Ancestry composition pie chart
โ โโโ ibs_context.png # IBS score on relationship spectrum gauge
โ โโโ ancestry_comparison.png # Side-by-side ancestry comparison
โโโ reproducibility/
โโโ commands.sh # Exact command to reproduce
Dependencies
Required:
- Python 3.10+
numpy>= 1.24matplotlib>= 3.7
Safety
- All processing is local. Genetic data never leaves the machine.
- Ancestry estimation is approximate โ for clinical-grade results, use ADMIXTURE or professional services.
- ClawBio is a research and educational tool. It is not a medical device.
Integration with Bio Orchestrator
Trigger conditions โ the orchestrator routes here when:
- User asks to compare genomes, mentions IBS, George Church, or Corpasome
- User provides a 23andMe file and asks "how similar am I to..."
Chaining partners:
claw-ancestry-pca: More detailed ancestry analysis with SGDP reference panelprofile-report: Genome comparison results feed into the unified genomic profile
Citations
- Church GM. The Personal Genome Project. Mol Syst Biol. 2005;1:2005.0030.
- Corpas M. Crowdsourcing the Corpasome. Source Code Biol Med. 2013;8:13.
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
vcf-annotator
Annotate VCF variants with VEP, ClinVar, gnomAD frequencies, and ancestry-aware context. Generates prioritised variant reports.
chemist-analyst
Analyzes events through chemistry lens using molecular structure, reaction mechanisms, thermodynamics, kinetics, and analytical techniques (spectroscopy, chromatography, mass spectrometry). Provides insights on chemical processes, material properties, reaction pathways, synthesis, and analytical methods. Use when: Chemical reactions, material analysis, synthesis planning, process optimization, environmental chemistry. Evaluates: Molecular structure, reaction mechanisms, yield, selectivity, safety, environmental impact.
bio-alignment-io
Read, write, and convert multiple sequence alignment files using Biopython Bio.AlignIO. Supports Clustal, PHYLIP, Stockholm, FASTA, Nexus, and other alignment formats for phylogenetics and conservation analysis. Use when reading, writing, or converting alignment file formats.
sleep-analyzer
ๅๆ็ก็ ๆฐๆฎใ่ฏๅซ็ก็ ๆจกๅผใ่ฏไผฐ็ก็ ่ดจ้๏ผๅนถๆไพไธชๆงๅ็ก็ ๆนๅๅปบ่ฎฎใๆฏๆไธๅ ถไปๅฅๅบทๆฐๆฎ็ๅ ณ่ๅๆใ
metabolomics-workbench-database
Access NIH Metabolomics Workbench via REST API (4,200+ studies). Query metabolites, RefMet nomenclature, MS/NMR data, m/z searches, study metadata, for metabolomics and biomarker discovery.
bio-hi-c-analysis-matrix-operations
Balance, normalize, and transform Hi-C contact matrices using cooler and cooltools. Apply iterative correction (ICE), compute expected values, and generate observed/expected matrices. Use when normalizing or transforming Hi-C matrices.
Didn't find tool you were looking for?