Agent skill

depmap

Query the Cancer Dependency Map (DepMap) for cancer cell line gene dependency scores (CRISPR Chronos), drug sensitivity data, and gene effect profiles. Use for identifying cancer-specific vulnerabilities, synthetic lethal interactions, and validating oncology drug targets.

View SKILL.md on GitHub Repository

Stars 16,890

Forks 1,841

Install this agent skill to your Project

npx add-skill https://github.com/K-Dense-AI/claude-scientific-skills/tree/main/scientific-skills/depmap

Metadata

Additional technical details for this skill

skill author: Kuan-lin Huang

SKILL.md

DepMap — Cancer Dependency Map

Overview

The Cancer Dependency Map (DepMap) project, run by the Broad Institute, systematically characterizes genetic dependencies across hundreds of cancer cell lines using genome-wide CRISPR knockout screens (DepMap CRISPR), RNA interference (RNAi), and compound sensitivity assays (PRISM). DepMap data is essential for:

Identifying which genes are essential for specific cancer types
Finding cancer-selective dependencies (therapeutic targets)
Validating oncology drug targets
Discovering synthetic lethal interactions

Key resources:

DepMap Portal: https://depmap.org/portal/
DepMap data downloads: https://depmap.org/portal/download/all/
Python package: depmap (or access via API/downloads)
API: https://depmap.org/portal/api/

When to Use This Skill

Use DepMap when:

Target validation: Is a gene essential for survival in cancer cell lines with a specific mutation (e.g., KRAS-mutant)?
Biomarker discovery: What genomic features predict sensitivity to knockout of a gene?
Synthetic lethality: Find genes that are selectively essential when another gene is mutated/deleted
Drug sensitivity: What cell line features predict response to a compound?
Pan-cancer essentiality: Is a gene broadly essential across all cancer types (bad target) or selectively essential?
Correlation analysis: Which pairs of genes have correlated dependency profiles (co-essentiality)?

Core Concepts

Dependency Scores

Score	Range	Meaning
Chronos (CRISPR)	~ -3 to 0+	More negative = more essential. Common essential threshold: −1. Pan-essential genes ~−1 to −2
RNAi DEMETER2	~ -3 to 0+	Similar scale to Chronos
Gene Effect	normalized	Normalized Chronos; −1 = median effect of common essential genes

Key thresholds:

Chronos ≤ −0.5: likely dependent
Chronos ≤ −1: strongly dependent (common essential range)

Cell Line Annotations

Each cell line has:

DepMap_ID: unique identifier (e.g., ACH-000001)
cell_line_name: human-readable name
primary_disease: cancer type
lineage: broad tissue lineage
lineage_subtype: specific subtype

Core Capabilities

1. DepMap API

python

import requests
import pandas as pd

BASE_URL = "https://depmap.org/portal/api"

def depmap_get(endpoint, params=None):
    url = f"{BASE_URL}/{endpoint}"
    response = requests.get(url, params=params)
    response.raise_for_status()
    return response.json()

2. Gene Dependency Scores

python

def get_gene_dependency(gene_symbol, dataset="Chronos_Combined"):
    """Get CRISPR dependency scores for a gene across all cell lines."""
    url = f"{BASE_URL}/gene"
    params = {
        "gene_id": gene_symbol,
        "dataset": dataset
    }
    response = requests.get(url, params=params)
    return response.json()

# Alternatively, use the /data endpoint:
def get_dependencies_slice(gene_symbol, dataset_name="CRISPRGeneEffect"):
    """Get a gene's dependency slice from a dataset."""
    url = f"{BASE_URL}/data/gene_dependency"
    params = {"gene_name": gene_symbol, "dataset_name": dataset_name}
    response = requests.get(url, params=params)
    data = response.json()
    return data

3. Download-Based Analysis (Recommended for Large Queries)

For large-scale analysis, download DepMap data files and analyze locally:

python

import pandas as pd
import requests, os

def download_depmap_data(url, output_path):
    """Download a DepMap data file."""
    response = requests.get(url, stream=True)
    with open(output_path, 'wb') as f:
        for chunk in response.iter_content(chunk_size=8192):
            f.write(chunk)

# DepMap 24Q4 data files (update version as needed)
FILES = {
    "crispr_gene_effect": "https://figshare.com/ndownloader/files/...",
    # OR download from: https://depmap.org/portal/download/all/
    # Files available:
    # CRISPRGeneEffect.csv - Chronos gene effect scores
    # OmicsExpressionProteinCodingGenesTPMLogp1.csv - mRNA expression
    # OmicsSomaticMutationsMatrixDamaging.csv - mutation binary matrix
    # OmicsCNGene.csv - copy number
    # sample_info.csv - cell line metadata
}

def load_depmap_gene_effect(filepath="CRISPRGeneEffect.csv"):
    """
    Load DepMap CRISPR gene effect matrix.
    Rows = cell lines (DepMap_ID), Columns = genes (Symbol (EntrezID))
    """
    df = pd.read_csv(filepath, index_col=0)
    # Rename columns to gene symbols only
    df.columns = [col.split(" ")[0] for col in df.columns]
    return df

def load_cell_line_info(filepath="sample_info.csv"):
    """Load cell line metadata."""
    return pd.read_csv(filepath)

4. Identifying Selective Dependencies

python

import numpy as np
import pandas as pd

def find_selective_dependencies(gene_effect_df, cell_line_info, target_gene,
                                 cancer_type=None, threshold=-0.5):
    """Find cell lines selectively dependent on a gene."""

    # Get scores for target gene
    if target_gene not in gene_effect_df.columns:
        return None

    scores = gene_effect_df[target_gene].dropna()
    dependent = scores[scores <= threshold]

    # Add cell line info
    result = pd.DataFrame({
        "DepMap_ID": dependent.index,
        "gene_effect": dependent.values
    }).merge(cell_line_info[["DepMap_ID", "cell_line_name", "primary_disease", "lineage"]])

    if cancer_type:
        result = result[result["primary_disease"].str.contains(cancer_type, case=False, na=False)]

    return result.sort_values("gene_effect")

# Example usage (after loading data)
# df_effect = load_depmap_gene_effect("CRISPRGeneEffect.csv")
# cell_info = load_cell_line_info("sample_info.csv")
# deps = find_selective_dependencies(df_effect, cell_info, "KRAS", cancer_type="Lung")

5. Biomarker Analysis (Gene Effect vs. Mutation)

python

import pandas as pd
from scipy import stats

def biomarker_analysis(gene_effect_df, mutation_df, target_gene, biomarker_gene):
    """
    Test if mutation in biomarker_gene predicts dependency on target_gene.

    Args:
        gene_effect_df: CRISPR gene effect DataFrame
        mutation_df: Binary mutation DataFrame (1 = mutated)
        target_gene: Gene to assess dependency of
        biomarker_gene: Gene whose mutation may predict dependency
    """
    if target_gene not in gene_effect_df.columns or biomarker_gene not in mutation_df.columns:
        return None

    # Align cell lines
    common_lines = gene_effect_df.index.intersection(mutation_df.index)
    scores = gene_effect_df.loc[common_lines, target_gene].dropna()
    mutations = mutation_df.loc[scores.index, biomarker_gene]

    mutated = scores[mutations == 1]
    wt = scores[mutations == 0]

    stat, pval = stats.mannwhitneyu(mutated, wt, alternative='less')

    return {
        "target_gene": target_gene,
        "biomarker_gene": biomarker_gene,
        "n_mutated": len(mutated),
        "n_wt": len(wt),
        "mean_effect_mutated": mutated.mean(),
        "mean_effect_wt": wt.mean(),
        "pval": pval,
        "significant": pval < 0.05
    }

6. Co-Essentiality Analysis

python

import pandas as pd

def co_essentiality(gene_effect_df, target_gene, top_n=20):
    """Find genes with most correlated dependency profiles (co-essential partners)."""
    if target_gene not in gene_effect_df.columns:
        return None

    target_scores = gene_effect_df[target_gene].dropna()

    correlations = {}
    for gene in gene_effect_df.columns:
        if gene == target_gene:
            continue
        other_scores = gene_effect_df[gene].dropna()
        common = target_scores.index.intersection(other_scores.index)
        if len(common) < 50:
            continue
        r = target_scores[common].corr(other_scores[common])
        if not pd.isna(r):
            correlations[gene] = r

    corr_series = pd.Series(correlations).sort_values(ascending=False)
    return corr_series.head(top_n)

# Co-essential genes often share biological complexes or pathways

Query Workflows

Workflow 1: Target Validation for a Cancer Type

Download CRISPRGeneEffect.csv and sample_info.csv
Filter cell lines by cancer type
Compute mean gene effect for target gene in cancer vs. all others
Calculate selectivity: how specific is the dependency to your cancer type?
Cross-reference with mutation, expression, or CNA data as biomarkers

Workflow 2: Synthetic Lethality Screen

Identify cell lines with mutation/deletion in gene of interest (e.g., BRCA1-mutant)
Compute gene effect scores for all genes in mutant vs. WT lines
Identify genes significantly more essential in mutant lines (synthetic lethal partners)
Filter by selectivity and effect size

Workflow 3: Compound Sensitivity Analysis

Download PRISM compound sensitivity data (primary-screen-replicate-treatment-info.csv)
Correlate compound AUC/log2(fold-change) with genomic features
Identify predictive biomarkers for compound sensitivity

DepMap Data Files Reference

File	Description
`CRISPRGeneEffect.csv`	CRISPR Chronos gene effect (primary dependency data)
`CRISPRGeneEffectUnscaled.csv`	Unscaled CRISPR scores
`RNAi_merged.csv`	DEMETER2 RNAi dependency
`sample_info.csv`	Cell line metadata (lineage, disease, etc.)
`OmicsExpressionProteinCodingGenesTPMLogp1.csv`	mRNA expression
`OmicsSomaticMutationsMatrixDamaging.csv`	Damaging somatic mutations (binary)
`OmicsCNGene.csv`	Copy number per gene
`PRISM_Repurposing_Primary_Screens_Data.csv`	Drug sensitivity (repurposing library)

Download all files from: https://depmap.org/portal/download/all/

Best Practices

Use Chronos scores (not DEMETER2) for current CRISPR analyses — better controlled for cutting efficiency
Distinguish pan-essential from cancer-selective: Target genes with low variance (essential in all lines) are poor drug targets
Validate with expression data: A gene not expressed in a cell line will score as non-essential regardless of actual function
Use DepMap ID for cell line identification — cell_line_name can be ambiguous
Account for copy number: Amplified genes may appear essential due to copy number effect (junk DNA hypothesis)
Multiple testing correction: When computing biomarker associations genome-wide, apply FDR correction

Additional Resources

DepMap Portal: https://depmap.org/portal/
Data downloads: https://depmap.org/portal/download/all/
DepMap paper: Behan FM et al. (2019) Nature. PMID: 30971826
Chronos paper: Dempster JM et al. (2021) Nature Methods. PMID: 34349281
GitHub: https://github.com/broadinstitute/depmap-portal
Figshare: https://figshare.com/articles/dataset/DepMap_24Q4_Public/27993966

Maintainer

K-Dense-AI Core maintainer

Source details

Full Name: K-Dense-AI/claude-scientific-skills
Branch: main
Path in repo: scientific-skills/depmap
License: MIT License
Topics: claude agent-skills claudecode claude-skills bioinformatics ai-scientist chemoinformatics clinical-research computational-biology data-analysis drug-discovery genomics materials-science metabolomics proteomics scientific-computing scientific-visualization

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

K-Dense-AI/claude-scientific-skills

pufferlib

High-performance reinforcement learning framework optimized for speed and scale. Use when you need fast parallel training, vectorized environments, multi-agent systems, or integration with game environments (Atari, Procgen, NetHack). Achieves 2-10x speedups over standard implementations. For quick prototyping or standard algorithm implementations with extensive documentation, use stable-baselines3 instead.

16,890 1,841

Explore

K-Dense-AI/claude-scientific-skills

fluidsim

Framework for computational fluid dynamics simulations using Python. Use when running fluid dynamics simulations including Navier-Stokes equations (2D/3D), shallow water equations, stratified flows, or when analyzing turbulence, vortex dynamics, or geophysical flows. Provides pseudospectral methods with FFT, HPC support, and comprehensive output analysis.

16,890 1,841

Explore

K-Dense-AI/claude-scientific-skills

geniml

This skill should be used when working with genomic interval data (BED files) for machine learning tasks. Use for training region embeddings (Region2Vec, BEDspace), single-cell ATAC-seq analysis (scEmbed), building consensus peaks (universes), or any ML-based analysis of genomic regions. Applies to BED file collections, scATAC-seq data, chromatin accessibility datasets, and region-based genomic feature learning.

16,890 1,841

Explore

K-Dense-AI/claude-scientific-skills

astropy

Comprehensive Python library for astronomy and astrophysics. This skill should be used when working with astronomical data including celestial coordinates, physical units, FITS files, cosmological calculations, time systems, tables, world coordinate systems (WCS), and astronomical data analysis. Use when tasks involve coordinate transformations, unit conversions, FITS file manipulation, cosmological distance calculations, time scale conversions, or astronomical data processing.

16,890 1,841

Explore

K-Dense-AI/claude-scientific-skills

pyhealth

Comprehensive healthcare AI toolkit for developing, testing, and deploying machine learning models with clinical data. This skill should be used when working with electronic health records (EHR), clinical prediction tasks (mortality, readmission, drug recommendation), medical coding systems (ICD, NDC, ATC), physiological signals (EEG, ECG), healthcare datasets (MIMIC-III/IV, eICU, OMOP), or implementing deep learning models for healthcare applications (RETAIN, SafeDrug, Transformer, GNN).

16,890 1,841

Explore

K-Dense-AI/claude-scientific-skills

research-lookup

Look up current research information using the Parallel Chat API (primary) or Perplexity sonar-pro-search (academic paper searches). Automatically routes queries to the best backend. Use for finding papers, gathering research data, and verifying scientific information.

16,890 1,841

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

Metadata

SKILL.md

DepMap — Cancer Dependency Map

Overview

When to Use This Skill

Core Concepts

Dependency Scores

Cell Line Annotations

Core Capabilities

1. DepMap API

2. Gene Dependency Scores

3. Download-Based Analysis (Recommended for Large Queries)

4. Identifying Selective Dependencies

5. Biomarker Analysis (Gene Effect vs. Mutation)

6. Co-Essentiality Analysis

Query Workflows

Workflow 1: Target Validation for a Cancer Type

Workflow 2: Synthetic Lethality Screen

Workflow 3: Compound Sensitivity Analysis

DepMap Data Files Reference

Best Practices

Additional Resources

Recommended Agent Skills

pufferlib

fluidsim

geniml

astropy

pyhealth

research-lookup