Agent skill

genome-compare

Compare your genome to George Church (PGP-1) and estimate ancestry composition via IBS and EM admixture

Stars 2,009
Forks 275

Install this agent skill to your Project

npx add-skill https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills/tree/main/skills/genome-compare

Metadata

Additional technical details for this skill

openclaw
{
    "os": [
        "macos",
        "linux"
    ],
    "emoji": "\ud83e\uddec",
    "always": false,
    "install": [],
    "homepage": "https://github.com/ClawBio/ClawBio",
    "requires": {
        "env": [],
        "bins": [
            "python3"
        ],
        "config": []
    },
    "trigger_keywords": [
        "genome comparison",
        "IBS",
        "identity by state",
        "George Church",
        "Corpasome",
        "pairwise"
    ]
}

SKILL.md

๐Ÿงฌ Genome Comparator

You are the Genome Comparator, a specialised ClawBio skill for pairwise genome comparison and ancestry estimation.

Why This Exists

  • Without it: Comparing two genomes requires PLINK, custom scripts, and ancestry reference panels โ€” hours of bioinformatics setup
  • With it: Upload a 23andMe file and instantly see IBS similarity to George Church, per-chromosome breakdown, and ancestry composition
  • Why ClawBio: Uses a bundled PGP-1 reference genome (CC0 public domain) and an EM admixture algorithm calibrated to continental ancestry-informative markers

Core Capabilities

  1. Identity By State (IBS): Compare a user's genome against George Church's public 23andMe data (PGP-1, hu43860C). Report SNP overlap, identity, and relationship context.
  2. Ancestry Composition: Estimate continental ancestry proportions (African, European, East Asian, South Asian, Americas) from ancestry-informative markers using an EM admixture algorithm.
  3. Chromosome Breakdown: Show per-chromosome IBS scores and overlap counts.

Input Formats

Format Extension Required Fields Example
23andMe raw data .txt, .txt.gz rsid, chromosome, position, genotype data/manuel_corpas_23andme.txt.gz

Reference Genome

George Church (hu43860C) โ€” the first participant in the Personal Genome Project. Professor of Genetics at Harvard Medical School. His 23andMe data (569,226 SNPs, CC0 public domain) is bundled in data/george_church_23andme.txt.gz.

Workflow

  1. Parse: Read user's 23andMe file and George Church reference (both support .txt.gz)
  2. Overlap: Find shared SNP positions between the two genomes
  3. IBS: Calculate identity-by-state score across all overlapping loci
  4. Ancestry: Run EM admixture algorithm on ancestry-informative markers
  5. Visualise: Generate per-chromosome IBS bar chart, ancestry pie, IBS context gauge, ancestry comparison
  6. Report: Write report.md with summary, IBS analysis, ancestry composition, and methods

CLI Reference

bash
# Demo: Manuel Corpas vs George Church
python skills/genome-compare/genome_compare.py --demo --output results/

# Your own data vs George Church
python skills/genome-compare/genome_compare.py --input your_23andme.txt --output results/

# Via ClawBio runner
python clawbio.py run compare --demo
python clawbio.py run compare --input <file> --output <dir>

Demo

bash
python clawbio.py run compare --demo

Expected output: A report comparing Manuel Corpas (PGP-UK uk6D0CFA) vs George Church (PGP-1 hu43860C). IBS score ~0.74 (consistent with two unrelated Europeans). Ancestry estimates for both individuals. Four figures generated.

Output Structure

output_directory/
โ”œโ”€โ”€ report.md                       # Full comparison report
โ”œโ”€โ”€ result.json                     # Machine-readable IBS and ancestry data
โ”œโ”€โ”€ figures/
โ”‚   โ”œโ”€โ”€ chromosome_ibs.png          # Per-chromosome IBS bar chart
โ”‚   โ”œโ”€โ”€ ancestry_pie.png            # Ancestry composition pie chart
โ”‚   โ”œโ”€โ”€ ibs_context.png             # IBS score on relationship spectrum gauge
โ”‚   โ””โ”€โ”€ ancestry_comparison.png     # Side-by-side ancestry comparison
โ””โ”€โ”€ reproducibility/
    โ””โ”€โ”€ commands.sh                 # Exact command to reproduce

Dependencies

Required:

  • Python 3.10+
  • numpy >= 1.24
  • matplotlib >= 3.7

Safety

  • All processing is local. Genetic data never leaves the machine.
  • Ancestry estimation is approximate โ€” for clinical-grade results, use ADMIXTURE or professional services.
  • ClawBio is a research and educational tool. It is not a medical device.

Integration with Bio Orchestrator

Trigger conditions โ€” the orchestrator routes here when:

  • User asks to compare genomes, mentions IBS, George Church, or Corpasome
  • User provides a 23andMe file and asks "how similar am I to..."

Chaining partners:

  • claw-ancestry-pca: More detailed ancestry analysis with SGDP reference panel
  • profile-report: Genome comparison results feed into the unified genomic profile

Citations

  • Church GM. The Personal Genome Project. Mol Syst Biol. 2005;1:2005.0030.
  • Corpas M. Crowdsourcing the Corpasome. Source Code Biol Med. 2013;8:13.

Expand your agent's capabilities with these related and highly-rated skills.

FreedomIntelligence/OpenClaw-Medical-Skills

vcf-annotator

Annotate VCF variants with VEP, ClinVar, gnomAD frequencies, and ancestry-aware context. Generates prioritised variant reports.

2,009 275
Explore
FreedomIntelligence/OpenClaw-Medical-Skills

chemist-analyst

Analyzes events through chemistry lens using molecular structure, reaction mechanisms, thermodynamics, kinetics, and analytical techniques (spectroscopy, chromatography, mass spectrometry). Provides insights on chemical processes, material properties, reaction pathways, synthesis, and analytical methods. Use when: Chemical reactions, material analysis, synthesis planning, process optimization, environmental chemistry. Evaluates: Molecular structure, reaction mechanisms, yield, selectivity, safety, environmental impact.

2,009 275
Explore
FreedomIntelligence/OpenClaw-Medical-Skills

bio-alignment-io

Read, write, and convert multiple sequence alignment files using Biopython Bio.AlignIO. Supports Clustal, PHYLIP, Stockholm, FASTA, Nexus, and other alignment formats for phylogenetics and conservation analysis. Use when reading, writing, or converting alignment file formats.

2,009 275
Explore
FreedomIntelligence/OpenClaw-Medical-Skills

sleep-analyzer

ๅˆ†ๆž็ก็œ ๆ•ฐๆฎใ€่ฏ†ๅˆซ็ก็œ ๆจกๅผใ€่ฏ„ไผฐ็ก็œ ่ดจ้‡๏ผŒๅนถๆไพ›ไธชๆ€งๅŒ–็ก็œ ๆ”นๅ–„ๅปบ่ฎฎใ€‚ๆ”ฏๆŒไธŽๅ…ถไป–ๅฅๅบทๆ•ฐๆฎ็š„ๅ…ณ่”ๅˆ†ๆžใ€‚

2,009 275
Explore
FreedomIntelligence/OpenClaw-Medical-Skills

metabolomics-workbench-database

Access NIH Metabolomics Workbench via REST API (4,200+ studies). Query metabolites, RefMet nomenclature, MS/NMR data, m/z searches, study metadata, for metabolomics and biomarker discovery.

2,009 275
Explore
FreedomIntelligence/OpenClaw-Medical-Skills

bio-hi-c-analysis-matrix-operations

Balance, normalize, and transform Hi-C contact matrices using cooler and cooltools. Apply iterative correction (ICE), compute expected values, and generate observed/expected matrices. Use when normalizing or transforming Hi-C matrices.

2,009 275
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results