Agent skill

ml-expert

Stars 232

Forks 15

Install this agent skill to your Project

npx add-skill https://github.com/aiskillstore/marketplace/tree/main/skills/dnyoussef/ml-expert

SKILL.md

ML Expert - Machine Learning Implementation Specialist

Version: 1.0.0 Type: Agent-based skill with SDK implementation Domain: Machine learning model implementation, training, and optimization

Description

Implement machine learning solutions including model architectures, training pipelines, optimization strategies, and performance improvements. This skill spawns a specialist ML implementation agent with deep expertise in PyTorch, deep learning architectures, training techniques, and production ML systems.

Use this skill when implementing new ML models, fixing training issues, optimizing performance, implementing research papers, or building production ML pipelines.

Triggers

This skill activates when users request:

"Implement this ML architecture"
"Fix the training code"
"Optimize model performance"
"Implement [paper/technique]"
"Build a training pipeline for..."
"Add [feature] to the model"
"Improve inference speed"

Skill Architecture

Skill Layer (Lightweight)

The skill handles:

Detection: Identify ML implementation requests
Context Gathering: Collect requirements, existing code, constraints
Agent Spawning: Invoke ML expert specialist with context
Result Processing: Validate and format implementation

Agent Layer (Specialist)

The ML expert agent handles:

Architecture Design: Create model structures following best practices
Implementation: Write production-quality PyTorch code
Optimization: Apply performance improvements and best practices
Validation: Ensure correctness through testing

Communication Protocol

Skill → Agent Context Package

json

{
  "task": "Implement TRM × Titans-MAG architecture",
  "requirements": {
    "model_type": "transformer",
    "parameters": "25M target",
    "features": ["sliding_window_attention", "long_term_memory", "ACT"],
    "framework": "pytorch",
    "constraints": {
      "vram": "6GB",
      "inference_speed": "real-time"
    }
  },
  "existing_code": {
    "files": ["model.py", "config.py"],
    "status": "partial_implementation"
  },
  "reference_materials": {
    "papers": ["TRM.pdf", "Titans-MAG.pdf"],
    "implementations": ["reference_model.py"]
  }
}

Agent → Skill Results

json

{
  "status": "implementation_complete",
  "deliverables": {
    "code_files": [
      {
        "path": "src/model/titans_mag.py",
        "description": "Titans-MAG backbone implementation",
        "loc": 350,
        "tested": true
      }
    ],
    "tests": [
      {
        "path": "tests/test_model.py",
        "coverage": 95,
        "all_passing": true
      }
    ],
    "documentation": {
      "architecture_diagram": "docs/architecture.md",
      "usage_examples": "examples/train.py"
    }
  },
  "performance_metrics": {
    "parameter_count": "25.6M",
    "inference_time": "45ms (GPU)",
    "vram_usage": "5.2GB"
  },
  "validation": {
    "unit_tests": "48/48 passing",
    "integration_tests": "12/12 passing",
    "manual_verification": "Forward/backward pass successful"
  }
}

Agent Spawning Logic

python

from claude_agent_sdk import ClaudeSDKClient, ClaudeAgentOptions
import asyncio

async def execute_ml_expert(context: dict):
    """Spawn ML implementation specialist agent."""

    # Load specialist agent prompt
    with open('agents/ml-expert-specialist.prompt', 'r') as f:
        specialist_prompt = f.read()

    # Configure agent with write permissions (plan mode for safety)
    options = ClaudeAgentOptions(
        model='claude-sonnet-4-5',
        system_prompt=specialist_prompt,
        permission_mode='plan',  # Show intent before editing
        allowed_tools=['Read', 'Write', 'Edit', 'Bash', 'Grep'],
        setting_sources=['project']
    )

    client = ClaudeSDKClient(options)

    try:
        await client.connect()

        # Format task for agent
        task = f"""Implement ML solution:

Requirements: {context['requirements']}

Existing code: {context['existing_code']}

Reference materials: {context['reference_materials']}

Deliver production-quality implementation with tests and documentation."""

        await client.query(task)

        # Collect implementation results
        results = []
        async for message in client.receive_messages():
            if message.type == 'assistant':
                results.append(message.content)

        return parse_implementation(results)

    finally:
        await client.disconnect()

Resources

Scripts

scripts/init_model_template.py - Generate model boilerplate
scripts/test_model.py - Model testing utilities
scripts/profile_performance.py - Performance profiling
scripts/validate_architecture.py - Architecture validation

References

references/pytorch-best-practices.md - PyTorch coding standards
references/architecture-patterns.md - Common ML architecture patterns
references/optimization-techniques.md - Training optimization guide
references/testing-guide.md - ML testing best practices

Templates

templates/model_template.py - Base model class template
templates/trainer_template.py - Training loop template
templates/config_template.py - Configuration dataclass template

Custom Tools

create_model_skeleton() - Generate model file structure
add_tests() - Create test cases for model components
benchmark_model() - Performance benchmarking

Usage Examples

Example 1: Implement New Architecture

User: "Implement the TRM × Titans-MAG architecture from these papers with 25M parameters"

Skill gathers:
- Paper PDFs with architecture details
- Target parameter count constraint
- PyTorch as framework
- GPU memory constraint (6GB)

Agent implements:
- Titans-MAG backbone (sliding window attention, LMM, MAG gate)
- TRM wrapper (multi-pass reasoning)
- ACT head (adaptive computation)
- Full model integration
- Unit tests (95% coverage)
- Usage examples
- Architecture documentation

Deliverables:
- src/model/titans_mag.py (350 LOC)
- src/model/trm_wrapper.py (180 LOC)
- src/model/act_head.py (120 LOC)
- src/model/full_model.py (200 LOC)
- tests/test_*.py (48 tests, all passing)
- docs/architecture.md

Example 2: Fix Training Issue

User: "The ACT head has variance=0 issue. Add diversity regularization."

Skill gathers:
- Current ACT head implementation
- Diagnosis from ml-training-debugger skill
- Recommended fix (diversity loss)

Agent implements:
- Modify compute_act_loss() method
- Add diversity regularization term
- Update docstrings
- Add test for variance>0
- Verify training runs without warning

Deliverables:
- Modified src/model/act_head.py
- New test: tests/test_act_diversity.py
- Validation: Warning eliminated in training

Example 3: Optimize Performance

User: "Model inference is too slow. Optimize for real-time performance."

Skill gathers:
- Current model code
- Profiling results
- Performance requirements (< 100ms)

Agent optimizes:
- Enable gradient checkpointing
- Fuse operations where possible
- Use torch.compile() for JIT optimization
- Optimize tensor operations
- Add caching for repeated computations

Deliverables:
- Optimized model code
- Performance benchmarks (45ms → 28ms)
- Memory usage reduced (6.2GB → 5.2GB)
- All tests still passing

Quality Standards

The ML expert agent must:

✅ Write production-quality, well-documented code
✅ Follow PyTorch best practices and idioms
✅ Include comprehensive tests (≥90% coverage)
✅ Verify all implementations work end-to-end
✅ Provide usage examples and documentation
✅ Optimize for readability and maintainability

Integration with Other Skills

This skill works with:

ml-training-debugger - Implements fixes from diagnoses
code-analyzer - Reviews implementation quality
functionality-audit - Validates implementations work
style-audit - Ensures code style compliance

Failure Modes and Escalation

If the agent cannot implement the solution:

Clarify ambiguous requirements with user
Request additional reference materials
Implement partial solution with clear TODOs
Escalate if task exceeds ML expertise scope

The agent should NEVER:

Write code without understanding requirements
Implement untested functionality
Make breaking changes to existing APIs
Commit directly without validation

Testing

Test the skill with:

New model implementation (TRM × Titans-MAG)
Bug fix implementation (ACT diversity loss)
Performance optimization (inference speed)
Research paper implementation
Production pipeline creation

Documentation

Agent system prompt: agents/ml-expert-specialist.prompt
SDK implementation: index.py
Process visualization: ml-expert-process.dot
Testing guide: tests/README.md

Next Steps:

Create agent system prompt with ML implementation expertise
Implement SDK-based agent spawning
Add model templates and utilities
Test on Phase 1 implementation tasks
Integrate with ml-training-debugger workflow

Maintainer

aiskillstore Core maintainer

Source details

Full Name: aiskillstore/marketplace
Branch: main
Path in repo: skills/dnyoussef/ml-expert
Topics: claude-code claude codex-skills skills codex claude-skills ai-skills

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

aiskillstore/marketplace

perigon-backend

Perigon ASP.NET Core + EF Core + Aspire conventions

232 15

Explore

aiskillstore/marketplace

perigon-agent

Pointers for Copilot/agents to apply Perigon conventions

232 15

Explore

aiskillstore/marketplace

perigon-angular

Angular 21+ standalone/Material/signal conventions for Perigon WebApp

232 15

Explore

aiskillstore/marketplace

fastapi-mastery

Comprehensive FastAPI development skill covering REST API creation, routing, request/response handling, validation, authentication, database integration, middleware, and deployment. Use when working with FastAPI projects, building APIs, implementing CRUD operations, setting up authentication/authorization, integrating databases (SQL/NoSQL), adding middleware, handling WebSockets, or deploying FastAPI applications. Triggered by requests involving .py files with FastAPI code, API endpoint creation, Pydantic models, or FastAPI-specific features.

232 15

Explore

aiskillstore/marketplace

context7-efficient

Token-efficient library documentation fetcher using Context7 MCP with 86.8% token savings through intelligent shell pipeline filtering. Fetches code examples, API references, and best practices for JavaScript, Python, Go, Rust, and other libraries. Use when users ask about library documentation, need code examples, want API usage patterns, are learning a new framework, need syntax reference, or troubleshooting with library-specific information. Triggers include questions like "Show me React hooks", "How do I use Prisma", "What's the Next.js routing syntax", or any request for library/framework documentation.

232 15

Explore

aiskillstore/marketplace

browser-use

Browser automation using Playwright MCP. Navigate websites, fill forms, click elements, take screenshots, and extract data. Use when tasks require web browsing, form submission, web scraping, UI testing, or any browser interaction.

232 15

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

ML Expert - Machine Learning Implementation Specialist

Description

Triggers

Skill Architecture

Skill Layer (Lightweight)

Agent Layer (Specialist)

Communication Protocol

Skill → Agent Context Package

Agent → Skill Results

Agent Spawning Logic

Resources

Scripts

References

Templates

Custom Tools

Usage Examples

Example 1: Implement New Architecture

Example 2: Fix Training Issue

Example 3: Optimize Performance

Quality Standards

Integration with Other Skills

Failure Modes and Escalation

Testing

Documentation

Recommended Agent Skills

perigon-backend

perigon-agent

perigon-angular

fastapi-mastery

context7-efficient

browser-use