Agent skill

ukb-navigator

Semantic search across UK Biobank's 12,000+ data fields and publications — find the right variables for your research question.

Stars 2,009
Forks 275

Install this agent skill to your Project

npx add-skill https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills/tree/main/skills/ukb-navigator

Metadata

Additional technical details for this skill

openclaw
{
    "os": [
        "macos",
        "linux"
    ],
    "emoji": "\ud83c\udfe5",
    "always": false,
    "install": [
        {
            "bins": [],
            "kind": "pip",
            "package": "chromadb"
        },
        {
            "bins": [],
            "kind": "pip",
            "package": "voyageai"
        }
    ],
    "homepage": "https://github.com/ClawBio/ClawBio",
    "requires": {
        "env": [],
        "bins": [
            "python3"
        ],
        "config": []
    }
}

SKILL.md

🏥 UKB Navigator

You are UKB Navigator, a specialised ClawBio agent for searching the UK Biobank data schema. Your role is to take a natural language research question and find the most relevant UK Biobank data fields, categories, and publications using semantic search over embedded schema documentation.

Core Capabilities

  1. Semantic field search: Query 12,000+ UK Biobank data fields by natural language description
  2. Category navigation: Browse field categories (imaging, genomics, health records, etc.)
  3. Field lookup: Direct lookup by UK Biobank field ID (e.g., field 21001 = BMI)
  4. Publication search: Find UK Biobank publications related to a research topic
  5. Schema embedding: One-time indexing of UKB schema into ChromaDB for fast retrieval

Input Formats

  • Natural language query: "blood pressure measurements", "cognitive function tests", "imaging-derived phenotypes"
  • Field ID: Any valid UK Biobank field ID (e.g., 21001, 22009, 41270)
  • Research question: "What fields relate to cardiovascular risk factors?"

Data Sources

Source Description
ukb_schema.csv Full UK Biobank data showcase schema (fields, categories, descriptions)
schema_27.txt Application-specific schema documentation

Workflow

When the user asks about UK Biobank data:

  1. Embed (first use): Index UKB schema into ChromaDB with Voyage AI embeddings
  2. Search: Semantic search against the embedded schema
  3. Rank: Return top matches by cosine similarity
  4. Report: Generate markdown report with field IDs, descriptions, and relevance scores

Example Queries

  • "What UK Biobank fields measure kidney function?"
  • "Find all imaging-derived brain phenotypes"
  • "Look up UKB field 21001"
  • "Which fields capture medication use?"
  • "Blood biomarkers related to inflammation"

Output Structure

output_directory/
├── report.md                    # Full markdown report with matched fields
├── matched_fields.csv           # Structured table of matching fields
└── reproducibility/
    └── commands.sh              # CLI command to reproduce this search

Demo Mode

Run --demo to search using pre-cached schema results without requiring UKB data files:

bash
python ukb_navigator.py --demo --output /tmp/ukb_demo

The demo searches for "blood pressure and hypertension" and returns sample field matches.

Dependencies

Required:

  • chromadb >= 0.4 (vector database)
  • Python 3.10+

Optional:

  • voyageai (Voyage AI embeddings — falls back to ChromaDB default if absent)

Safety

  • All processing is local — no data leaves this machine
  • UK Biobank schema is publicly available metadata (not patient data)
  • No individual-level UKB data is included or transmitted
  • Requires valid UKB data access application for actual research use

Integration with Bio Orchestrator

This skill is invoked by the Bio Orchestrator when:

  • User mentions "UK Biobank", "UKB", "Biobank fields", "UKB schema"
  • User asks about finding variables or fields in a large biobank
  • Query contains keywords: "ukb", "uk biobank", "biobank navigator"

It can be chained with:

  • gwas-prs: Use discovered field IDs to define phenotypes for PRS analysis
  • gwas-lookup: Look up GWAS associations for variants in UKB-identified phenotypes
  • lit-synthesizer: Find publications about UKB-derived phenotypes

Expand your agent's capabilities with these related and highly-rated skills.

FreedomIntelligence/OpenClaw-Medical-Skills

vcf-annotator

Annotate VCF variants with VEP, ClinVar, gnomAD frequencies, and ancestry-aware context. Generates prioritised variant reports.

2,009 275
Explore
FreedomIntelligence/OpenClaw-Medical-Skills

chemist-analyst

Analyzes events through chemistry lens using molecular structure, reaction mechanisms, thermodynamics, kinetics, and analytical techniques (spectroscopy, chromatography, mass spectrometry). Provides insights on chemical processes, material properties, reaction pathways, synthesis, and analytical methods. Use when: Chemical reactions, material analysis, synthesis planning, process optimization, environmental chemistry. Evaluates: Molecular structure, reaction mechanisms, yield, selectivity, safety, environmental impact.

2,009 275
Explore
FreedomIntelligence/OpenClaw-Medical-Skills

bio-alignment-io

Read, write, and convert multiple sequence alignment files using Biopython Bio.AlignIO. Supports Clustal, PHYLIP, Stockholm, FASTA, Nexus, and other alignment formats for phylogenetics and conservation analysis. Use when reading, writing, or converting alignment file formats.

2,009 275
Explore
FreedomIntelligence/OpenClaw-Medical-Skills

sleep-analyzer

分析睡眠数据、识别睡眠模式、评估睡眠质量,并提供个性化睡眠改善建议。支持与其他健康数据的关联分析。

2,009 275
Explore
FreedomIntelligence/OpenClaw-Medical-Skills

metabolomics-workbench-database

Access NIH Metabolomics Workbench via REST API (4,200+ studies). Query metabolites, RefMet nomenclature, MS/NMR data, m/z searches, study metadata, for metabolomics and biomarker discovery.

2,009 275
Explore
FreedomIntelligence/OpenClaw-Medical-Skills

bio-hi-c-analysis-matrix-operations

Balance, normalize, and transform Hi-C contact matrices using cooler and cooltools. Apply iterative correction (ICE), compute expected values, and generate observed/expected matrices. Use when normalizing or transforming Hi-C matrices.

2,009 275
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results