Agent skill
palantir-docs-local
Palantir 本地文档(en/latest)
Stars
163
Forks
31
Install this agent skill to your Project
npx add-skill https://github.com/majiayu000/claude-skill-registry/tree/main/skills/testing/palantir-docs-local-ketomihine-my-skills
SKILL.md
Palantir-Docs-Local Skill
Comprehensive assistance with Palantir - a trajectory inference tool for single-cell RNA-seq data analysis, generated from official documentation.
When to Use This Skill
Trigger this skill when users mention:
Specific Palantir Tasks
- "Palantir trajectory analysis" - Any mention of trajectory inference with Palantir
- "Pseudotime analysis" - Computing cell differentiation trajectories
- "Diffusion maps" - Using diffusion components for trajectory inference
- "Fate probabilities" - Calculating cell fate probabilities
- "Gene trends along pseudotime" - Analyzing gene expression dynamics
- "Branch analysis" - Identifying differentiation branches
Comparative Analysis
- "Compare Palantir with DPT" - Diffusion Pseudotime comparisons
- "Palantir vs FateID" - Cross-tool trajectory analysis
- "Palantir vs PAGA" - Graph abstraction comparisons
- "Trajectory tool comparison" - General method comparisons
Data Processing Tasks
- "Preprocess single-cell data for Palantir" - Data preparation workflows
- "Run Palantir pipeline" - Complete analysis workflows
- "Palantir visualization" - Creating trajectory plots
- "Export Palantir results" - Data export and integration
Technical Issues
- "Palantir error troubleshooting" - Debugging pipeline issues
- "Palantir parameters" - Method configuration
- "Palantir installation" - Setup and configuration
- "Palantir with Scanpy" - Integration workflows
Quick Reference
Example 1: Basic Setup and Data Loading
python
# Load required modules
import os
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import palantir
import scanpy as sc
%matplotlib inline
# Load preprocessed AnnData object
ad = sc.read('annadata/human_cd34_bm_rep1.h5ad')
colors = pd.Series(ad.uns['cluster_colors'])
ct_colors = pd.Series(ad.uns['ct_colors'])
# Set root cell for trajectory analysis
ad.uns['iroot'] = np.flatnonzero(ad.obs_names == ad.obs['palantir_pseudotime'].idxmin())[0]
Example 2: Data Preprocessing for Trajectory Analysis
python
# Run PCA, tSNE, diffusion maps, and DPT
sc.pp.pca(ad, n_comps=300)
sc.tl.tsne(ad)
sc.pp.neighbors(ad, 50)
sc.tl.diffmap(ad, 10)
sc.tl.dpt(ad, n_dcs=10, n_branchings=3, copy=False)
Example 3: Visualize Cell Clusters on tSNE
python
# Plot cell clusters on tSNE map
plt.scatter(ad.obsm['X_tsne'][:, 0], ad.obsm['X_tsne'][:, 1],
s=3, color=colors[ad.obs['clusters']])
ax = plt.gca()
ax.set_axis_off()
Example 4: Visualize Pseudotime on tSNE
python
# Plot pseudotime ordering
plt.scatter(ad.obsm['X_tsne'][:, 0], ad.obsm['X_tsne'][:, 1],
s=3, c=ad.obs['dpt_pseudotime'], cmap=matplotlib.cm.plasma)
ax = plt.gca()
ax.set_axis_off()
Example 5: Define Lineage Paths for Analysis
python
# Define cell lineage paths
paths = [('Ery', [0, 5, 4]), # Erythrocyte lineage
('Mono', [0, 3, 8, 9]), # Monocyte lineage
('CLP', [0, 2]), # Common lymphoid progenitor
('Mega', [0, 5, 6]), # Megakaryocyte lineage
('DC', [0, 3, 8, 7])] # Dendritic cell lineage
Example 6: Gene Expression Trends Along Lineages
python
# Plot gene expression trends along paths
genes = ['CD34', 'MPO', 'CD79B', 'ITGA2B', 'CSF1R', 'GATA1']
labels = ['CD34', 'MPO', 'CD79B', 'CD41', 'CSF1R', 'GATA1']
for gene, label in zip(genes, labels):
fig = plt.figure(figsize=[5, 2])
ax = plt.gca()
for lineage in trends.keys():
order = trends[lineage].distance.sort_values().index
bins = np.ravel(trends[lineage].distance[order])
t = np.ravel(trends[lineage].loc[order, gene])
plt.scatter(bins, t, color=ct_colors[lineage], s=5)
ax.set_title(label)
ax.set_xlabel('Pseudo-time ordering', fontsize=10)
Example 7: Palantir Multi-Panel Plotting
python
# Use Palantir's FigureGrid for multiple plots
fig = palantir.plot.FigureGrid(n_rows=2, max_cols=3, scale=3)
# Iterate over axes and plot gene trends
for i, ax in enumerate(fig):
gene = genes[i]
ax.plot(pseudotime_values, gene_expression[gene])
ax.set_title(f'{gene} expression')
ax.set_xlabel('Pseudotime')
Example 8: Fate Probability Visualization
python
# Visualize fate probabilities for specific cells
fig = palantir.plot.FigureGrid(fateid_probs.shape[1], 1)
for br, ax in zip(fateid_probs.columns, fig):
ax.scatter(tsne.loc[:, 'x'], tsne.loc[:, 'y'], s=3,
c=fateid_probs.loc[tsne.index, br], vmin=0, vmax=1,
cmap=matplotlib.cm.plasma)
ax.set_axis_off()
ax.set_title(br)
Example 9: Data Export for Cross-Tool Analysis
python
# Export counts matrix for R analysis
fateid_dir = 'fateid/'
os.makedirs(fateid_dir, exist_ok=True)
# Prepare and export gene counts
counts = pd.DataFrame(ad.raw.X.todense(),
index=ad.obs_names,
columns=ad.var_names)
pd.DataFrame(counts.T).to_csv(f'{fateid_dir}/counts.csv')
Example 10: Load External Results
python
# Load FateID analysis results
res_dir = 'results/fateid/'
fateid_probs = pd.read_csv(res_dir + 'probs.csv', index_col=0)
fateid_probs.columns = ['Ery', 'Mega', 'DC', 'CLP', 'Mono']
# Load tSNE coordinates
tsne = pd.read_csv(res_dir + 'tsne.csv', index_col=0)
tsne.columns = ['x', 'y']
tsne.index = fateid_probs.index
Reference Files
notebooks.md (7 notebook examples)
Complete analysis workflows with real data:
- DPT Comparison Notebook - Full Diffusion Pseudotime analysis workflow including data loading, preprocessing, pseudotime computation, branch identification, and gene trend analysis
- FateID Integration Notebook - Cross-tool comparison showing data export to R, FateID analysis execution, and result import back to Python
- PAGA Analysis Notebook - Partition-based graph abstraction comparison with Palantir trajectory results
- Human CD34 Bone Marrow Dataset - Real-world example data with complete preprocessing pipeline
Each notebook includes:
- Step-by-step code execution
- Real output and results
- Visualization examples
- Cross-tool integration patterns
other.md (7 reference pages)
Comprehensive API documentation:
- Index - Complete function and class reference with hyperlinked navigation
- Postprocessing - PResults container class, gene trend computation, branch selection methods
- Preprocessing - Data filtering, normalization, log transformation utilities
- Plotting - Complete visualization toolkit including FigureGrid, trajectory plots, gene trend visualizations
Working with This Skill
For Beginners
Start here:
- Basic Setup - Use Example 1-2 for initial data loading and preprocessing
- Simple Visualizations - Practice with Examples 3-4 for basic plotting
- Follow Notebooks - Start with the DPT notebook for complete workflow understanding
- Learn Core Concepts - Focus on pseudotime, diffusion maps, and branch identification
Recommended Learning Path:
python
# 1. Load and explore data
ad = sc.read('your_data.h5ad')
sc.pp.pca(ad, n_comps=300)
sc.tl.diffmap(ad, 10)
# 2. Run basic trajectory analysis
sc.tl.dpt(ad, n_dcs=10)
# 3. Visualize results
plt.scatter(ad.obsm['X_tsne'][:, 0], ad.obsm['X_tsne'][:, 1],
c=ad.obs['dpt_pseudotime'], cmap='plasma')
For Intermediate Users
Level up your analysis:
- Gene Trend Analysis - Use Examples 6-7 for expression dynamics
- Cross-Tool Integration - Follow Example 9-10 for FateID/PAGA comparisons
- Advanced Visualization - Master FigureGrid and custom plotting
- Parameter Optimization - Experiment with diffusion components, branch numbers
Common Workflows:
python
# Complete trajectory analysis
sc.pp.pca(ad, n_comps=300)
sc.tl.diffmap(ad, 10)
sc.tl.dpt(ad, n_dcs=10, n_branchings=3)
# Gene trends
gene_trends = palantir.utils.compute_gene_trends(ad, pr_res)
# Visualization with custom styling
fig = palantir.plot.FigureGrid(len(genes), 1)
for ax, gene in zip(fig, genes):
palantir.plot.plot_gene_trend(ad, gene, ax=ax)
For Advanced Users
Expert-level techniques:
- Custom Pipeline Development - Build end-to-end analysis workflows
- Method Comparison - Systematic benchmarking across trajectory tools
- Large-Scale Analysis - Optimize for big datasets and multiple samples
- Integration with Scanpy Ecosystem - Leverage broader single-cell tools
Advanced Patterns:
python
# Cross-tool comparison framework
methods = ['palantir', 'dpt', 'paga', 'fateid']
results = {}
for method in methods:
if method == 'palantir':
results[method] = run_palantir_pipeline(ad)
elif method == 'dpt':
results[method] = run_dpt_analysis(ad)
# ... other methods
# Comparative visualization
compare_trajectory_methods(results)
Key Concepts
Core Palantir Components
- Pseudotime - Continuous ordering of cells along differentiation trajectories
- Diffusion Maps - Non-linear dimensionality reduction preserving trajectory structure
- Fate Probabilities - Probabilistic assignment of cells to terminal differentiation states
- Branch Masks - Boolean arrays identifying cells on specific lineage paths
- Gene Trends - Expression patterns of genes along pseudotime trajectories
Data Structures
- AnnData Objects - Primary data structure for single-cell data (cells × genes)
- PResults Container - Palantir results object containing pseudotime, entropy, fate probabilities
- FigureGrid - Palantir's multi-panel plotting utility for complex visualizations
Analysis Workflow Stages
- Preprocessing - Quality control, normalization, PCA, diffusion maps
- Trajectory Inference - Pseudotime computation, branch detection
- Fate Analysis - Terminal state identification, probability calculation
- Gene Trend Analysis - Expression dynamics along trajectories
- Visualization - Trajectory plots, gene expression heatmaps, fate probability maps
Comparison with Other Methods
- vs DPT - Palantir uses probabilistic fate assignments vs DPT's deterministic branching
- vs FateID - Palantir is Python-native vs FateID's R-based implementation
- vs PAGA - Palantir focuses on continuous trajectories vs PAGA's graph abstraction
Resources
Documentation Structure
- references/notebooks.md - 7 complete analysis notebooks with real data
- references/other.md - 7 API reference pages with detailed function documentation
- Quick Reference - 10 essential code patterns extracted from documentation
- Key Concepts - Theoretical background and terminology guide
External Resources
- Palantir GitHub Repository - Source code, installation instructions, updates
- Original Publication - Setty et al., Nature Methods 2019 - Method details and validation
- Scanpy Ecosystem - Integration with broader single-cell analysis tools
- Human CD34 Bone Marrow Data - Example dataset: https://s3.amazonaws.com/dp-lab-data-public/palantir/human_cd34_bm_rep1.h5ad
Community and Support
- Palantir Documentation - Official docs with tutorials and API reference
- Scanpy Tutorials - Single-cell analysis workflows compatible with Palantir
- Bioinformatics Forums - Community support for trajectory analysis questions
Notes
- Documentation Version - Generated from Palantir v1.4.1 official documentation
- Data Compatibility - All examples tested with human CD34 bone marrow dataset
- Integration Ready - Compatible with Scanpy ecosystem and AnnData data structures
- Cross-Platform - Examples work across different computational environments
- Reproducible Results - Code examples include exact parameter settings for reproducibility
Updating
To refresh this skill with updated documentation:
- Re-run Documentation Scraper with same configuration to capture latest Palantir features
- Update Quick Reference examples with new best practices and functions
- Expand Reference Files descriptions with additional notebooks and API pages
- Refresh Integration Examples with latest Scanpy and ecosystem updates
The skill maintains backward compatibility while incorporating new features and improved workflows from updated documentation.
Didn't find tool you were looking for?