Agent skill

aiml-validation-framework

AI/ML medical device validation skill implementing FDA's GMLP principles

Stars 514
Forks 31

Install this agent skill to your Project

npx add-skill https://github.com/a5c-ai/babysitter/tree/main/library/specializations/domains/science/biomedical-engineering/skills/aiml-validation-framework

Metadata

Additional technical details for this skill

domain
science
category
Medical Device Software
skill id
BME-SK-021
specialization
biomedical-engineering

SKILL.md

AI/ML Validation Framework Skill

Purpose

The AI/ML Validation Framework Skill supports validation of AI/ML-enabled medical devices per FDA Good Machine Learning Practice (GMLP) principles, addressing data quality, model performance, and predetermined change control.

Capabilities

  • Training data quality assessment
  • Ground truth labeling validation
  • Model performance metrics calculation (AUC, sensitivity, specificity)
  • Subgroup performance analysis
  • Bias and fairness evaluation
  • Predetermined change control plan (PCCP) templates
  • Clinical validation study design
  • Locked algorithm vs. adaptive documentation
  • Model explainability documentation
  • Performance monitoring planning
  • Real-world performance tracking

Usage Guidelines

When to Use

  • Validating AI/ML algorithms
  • Assessing training data quality
  • Planning clinical validation studies
  • Preparing FDA AI/ML submissions

Prerequisites

  • Algorithm development complete
  • Training/test datasets curated
  • Ground truth established
  • Intended use clearly defined

Best Practices

  • Document data management practices
  • Validate on diverse populations
  • Plan for performance monitoring
  • Consider predetermined change control

Process Integration

This skill integrates with the following processes:

  • AI/ML Medical Device Development
  • Software Verification and Validation
  • Clinical Evaluation Report Development
  • Post-Market Surveillance System Implementation

Dependencies

  • FDA AI/ML guidance
  • GMLP principles
  • Fairness toolkits (AIF360, Fairlearn)
  • Statistical analysis tools
  • Clinical study resources

Configuration

yaml
aiml-validation-framework:
  algorithm-types:
    - locked
    - adaptive
    - continuously-learning
  performance-metrics:
    - AUC
    - sensitivity
    - specificity
    - PPV
    - NPV
  subgroup-categories:
    - age
    - sex
    - race
    - disease-severity

Output Artifacts

  • Data management documentation
  • Algorithm description documents
  • Performance reports
  • Bias/fairness assessments
  • PCCP documents
  • Clinical validation protocols
  • Monitoring plans
  • FDA submission sections

Quality Criteria

  • Training data quality documented
  • Ground truth methodology validated
  • Performance meets clinical requirements
  • Subgroup performance acceptable
  • Bias assessments completed
  • PCCP appropriate for algorithm type

Expand your agent's capabilities with these related and highly-rated skills.

a5c-ai/babysitter

gsd-tools

Central utility skill for GSD operations. Provides config parsing, slug generation, timestamps, path operations, and orchestrates calls to other specialized skills. Acts as the unified entry point that the original gsd-tools.cjs provided via its lib/ modules (commands, config, core, init).

514 31
Explore
a5c-ai/babysitter

model-profile-resolution

Resolve model profile (quality/balanced/budget) at orchestration start and map agents to specific models. Enables cost/quality tradeoffs by selecting appropriate AI models for each agent role.

514 31
Explore
a5c-ai/babysitter

verification-suite

Plan structure validation, phase completeness checks, reference integrity verification, and artifact existence confirmation. Provides the structured verification layer ensuring GSD artifacts are well-formed and complete.

514 31
Explore
a5c-ai/babysitter

state-management

STATE.md reading, writing, and field-level updates. Provides cross-session state persistence via .planning/STATE.md with structured fields for current task, completed phases, blockers, decisions, and quick tasks.

514 31
Explore
a5c-ai/babysitter

git-integration

Git commit patterns, formats, and conventions for GSD methodology. Provides atomic commits per task, structured commit messages, planning file commits, branch management, and milestone tag operations.

514 31
Explore
a5c-ai/babysitter

frontmatter-parsing

YAML frontmatter parsing and manipulation for .planning/ documents. Provides read, write, update, query, and validation operations on frontmatter blocks in GSD markdown artifacts.

514 31
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results