Agent skill
llm_post_training
Install this agent skill to your Project
npx add-skill https://github.com/Leeroo-AI/leeroopedia-mcp/tree/main/examples/llm_post_training
SKILL.md
Leeroopedia Knowledge Base Tools
You have access to the Leeroopedia MCP tools. Use them throughout this pipeline to make informed decisions. Specifically:
-
search_knowledge: Search the knowledge base before writing any code. Look up TRL SFTTrainer usage, DPO training patterns, LoRA configuration, vLLM serving config, and lm-evaluation-harness setup. Search from multiple angles in parallel. -
build_plan: Before starting each task, request a structured implementation plan from the knowledge base. Get the correct sequence of steps, key specs, and validation criteria grounded in real framework documentation. -
review_plan: After drafting your approach for each task, submit it for review against documented best practices. Catch incorrect assumptions and known pitfalls before writing code. -
propose_hypothesis: When you are unsure how to proceed or face a design decision (e.g. choosing between training strategies or serving configurations), use this tool to get ranked approaches backed by documented framework patterns. -
query_hyperparameter_priors: Query recommended values and tuning ranges for all key hyperparameters — LoRA rank/alpha, learning rates, batch sizes, DPO beta, vLLM memory utilization, tensor parallel size, and any other parameters. Get context-specific suggestions based on model size and hardware. -
verify_code_math: After writing critical code sections (training loops, serving config), verify correctness against knowledge base documentation and reference implementations. -
diagnose_failure: If any step fails or produces unexpected results, use this tool to diagnose the root cause against known failure patterns and common misconfigurations. -
get_page: When any tool response cites a[PageID], retrieve the full page for deeper detail.
Use these tools proactively — do not rely solely on your training knowledge. The knowledge base contains up-to-date, verified documentation that will help you avoid common mistakes and choose optimal configurations.
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
ml_inference_optimization
customer_support_agent
self_evolve_rag
using-superml
Use when starting any conversation involving ML/AI — establishes how to use Leeroopedia KB tools and workflow skills
ml-research
Use when the user wants to understand an ML/AI topic, compare approaches, or survey framework capabilities — "how does X work?", "compare X vs Y"
ml-iterate
Use when the user is stuck, needs ranked next steps, or wants alternatives after initial experiments — "I tried X and got Y, what next?"
Didn't find tool you were looking for?