Agent skill
mcaf-ml-ai-delivery
Apply ML/AI project delivery guidance for data exploration, feasibility, experimentation, testing, responsible AI, and operating ML systems. Use when the repo includes model training, inference, data science workflows, or ML-specific delivery planning.
Install this agent skill to your Project
npx add-skill https://github.com/managedcode/MCAF/tree/main/skills/mcaf-ml-ai-delivery
SKILL.md
MCAF: ML/AI Delivery
Trigger On
- the repo contains model training, inference, experimentation, or data-science workflow
- ML work needs explicit process, testing, or responsible-AI guidance
- delivery discussion is mixing product, data, and model concerns
Value
- produce a concrete project delta: code, docs, config, tests, CI, or review artifact
- reduce ambiguity through explicit planning, verification, and final validation skills
- leave reusable project context so future tasks are faster and safer
Do Not Use For
- generic software delivery with no ML or data-science component
- loading all ML references when only one stage is active
Inputs
- the current ML stage: framing, data exploration, experimentation, training, inference, or operations
- product assumptions, data assumptions, and model assumptions
- current verification and responsible-AI expectations
Quick Start
- Read the nearest
AGENTS.mdand confirm scope and constraints. - Run this skill's
Workflowthrough theRalph Loopuntil outcomes are acceptable. - Return the
Required Result Formatwith concrete artifacts and verification evidence.
Workflow
- Separate product assumptions, data assumptions, and model assumptions.
- Keep experimentation traceable and testable.
- Treat responsible AI, data quality, and ML-specific verification as first-class requirements.
- Load only the references that match the current ML stage.
Deliver
- clearer ML/AI delivery guidance
- better links between data, experimentation, verification, and responsible AI
- docs that match how the ML system is built and validated
Validate
- the active ML stage is explicit
- experimentation and evaluation are traceable
- responsible-AI and data-quality requirements are not bolted on at the end
Ralph Loop
Use the Ralph Loop for every task, including docs, architecture, testing, and tooling work.
- Brainstorm first (mandatory):
- analyze current state
- define the problem, target outcome, constraints, and risks
- generate options and think through trade-offs before committing
- capture the recommended direction and open questions
- Plan second (mandatory):
- write a detailed execution plan from the chosen direction
- list final validation skills to run at the end, with order and reason
- Execute one planned step and produce a concrete delta.
- Review the result and capture findings with actionable next fixes.
- Apply fixes in small batches and rerun the relevant checks or review steps.
- Update the plan after each iteration.
- Repeat until outcomes are acceptable or only explicit exceptions remain.
- If a dependency is missing, bootstrap it or return
status: not_applicablewith explicit reason and fallback path.
Required Result Format
status:complete|clean|improved|configured|not_applicable|blockedplan: concise plan and current iteration stepactions_taken: concrete changes madevalidation_skills: final skills run, or skipped with reasonsverification: commands, checks, or review evidence summaryremaining: top unresolved items ornone
For setup-only requests with no execution, return status: configured and exact next commands.
Load References
- read
references/ml-ai-projects.mdfirst - open
references/data-exploration.md,references/feasibility-studies.md,references/ml-fundamentals-checklist.md,references/model-experimentation.md,references/testing-data-science-and-mlops-code.md,references/responsible-ai.md, orreferences/ml-model-checklist.mdonly when that stage is active
Example Requests
- "Define the delivery workflow for this ML feature."
- "We need responsible-AI and testing guidance for this model."
- "Separate product, data, and model decisions in our docs."
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
mcaf-architecture-overview
Create or update `docs/Architecture.md` as the global architecture map for a solution. Use when bootstrapping a repo, onboarding, or changing modules, boundaries, or contracts. Keep it navigational and use `references/overview-template.md` for scaffolding.
mcaf-human-review-planning
Plan a human review for a large AI-generated code drop by reading the target area, tracing the natural user and system flows, identifying the riskiest boundaries, and prioritizing the files a human should inspect first. Use when the codebase is too large to review line-by-line and you need a practical review sequence plus a prioritized file list.
mcaf-documentation
Create or refine durable engineering documentation: docs structure, navigation, source-of-truth placement, and writing quality. Use when a repo’s docs are missing, stale, duplicated, or hard to navigate, or when adding new durable engineering guidance.
mcaf-observability
Design or improve observability for application and delivery flows: logs, metrics, traces, correlation, alerts, and operational diagnostics. Use when a change affects runtime visibility, failure diagnosis, SLOs, or alerting.
mcaf-agile-delivery
Shape delivery workflow around backlog quality, roles, ceremonies, and engineering feedback. Use when defining how the team plans, tracks work, and turns feedback into durable improvements.
mcaf-solid-maintainability
Apply SOLID, SRP, cohesion, composition-over-inheritance, and small-file discipline to code changes. Use when refactoring large files or classes, setting maintainability limits in `AGENTS.md`, documenting justified exceptions, or reviewing design quality.
Didn't find tool you were looking for?