llm_post_training

View SKILL.md on GitHub Repository

Stars 10

Forks 2

Install this agent skill to your Project

npx add-skill https://github.com/Leeroo-AI/leeroopedia-mcp/tree/main/examples/llm_post_training

SKILL.md

Leeroopedia Knowledge Base Tools

You have access to the Leeroopedia MCP tools. Use them throughout this pipeline to make informed decisions. Specifically:

search_knowledge: Search the knowledge base before writing any code. Look up TRL SFTTrainer usage, DPO training patterns, LoRA configuration, vLLM serving config, and lm-evaluation-harness setup. Search from multiple angles in parallel.
build_plan: Before starting each task, request a structured implementation plan from the knowledge base. Get the correct sequence of steps, key specs, and validation criteria grounded in real framework documentation.
review_plan: After drafting your approach for each task, submit it for review against documented best practices. Catch incorrect assumptions and known pitfalls before writing code.
propose_hypothesis: When you are unsure how to proceed or face a design decision (e.g. choosing between training strategies or serving configurations), use this tool to get ranked approaches backed by documented framework patterns.
query_hyperparameter_priors: Query recommended values and tuning ranges for all key hyperparameters — LoRA rank/alpha, learning rates, batch sizes, DPO beta, vLLM memory utilization, tensor parallel size, and any other parameters. Get context-specific suggestions based on model size and hardware.
verify_code_math: After writing critical code sections (training loops, serving config), verify correctness against knowledge base documentation and reference implementations.
diagnose_failure: If any step fails or produces unexpected results, use this tool to diagnose the root cause against known failure patterns and common misconfigurations.
get_page: When any tool response cites a [PageID], retrieve the full page for deeper detail.

Use these tools proactively — do not rely solely on your training knowledge. The knowledge base contains up-to-date, verified documentation that will help you avoid common mistakes and choose optimal configurations.

Maintainer

Leeroo-AI Core maintainer

Source details

Full Name: Leeroo-AI/leeroopedia-mcp
Branch: main
Path in repo: examples/llm_post_training
License: MIT License
Topics: ai-agents llm knowledge-base machine-learning knowledge-graph dataengineering

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

Leeroo-AI/leeroopedia-mcp

using-superml

Use when starting any conversation involving ML/AI — establishes how to use Leeroopedia KB tools and workflow skills

165 16

Explore

Leeroo-AI/superml

ml-research

Use when the user wants to understand an ML/AI topic, compare approaches, or survey framework capabilities — "how does X work?", "compare X vs Y"

165 16

Explore

Leeroo-AI/superml

ml-iterate

Use when the user is stuck, needs ranked next steps, or wants alternatives after initial experiments — "I tried X and got Y, what next?"

165 16

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Leeroopedia Knowledge Base Tools

Recommended Agent Skills

ml_inference_optimization

customer_support_agent

self_evolve_rag

using-superml

ml-research

ml-iterate