Agent skill
promotion-eval-create
Create a promotion evaluation template for any system by gathering requirements through structured questions and generating a reusable evaluation skill. Use when users ask to create a promotion check, release readiness evaluation, environment health template, or want to build a custom evaluation workflow for systems beyond Mission Control.
Install this agent skill to your Project
npx add-skill https://github.com/flanksource/claude-code-plugin/tree/main/skills/promotion-eval-create
SKILL.md
Create Promotion Evaluation Template
Core Purpose
Guide the user through a structured interview to build a custom promotion evaluation skill for their system. The output is a complete SKILL.md file they can use to evaluate environment health and release readiness.
Use @skills/promotion-eval-mission-control/SKILL.md as the reference implementation — it demonstrates the structure, phased evaluation approach, verdict logic, and report format that this skill generates for other systems.
Workflow
Step 1: Identify the System
Ask the user:
- What system or platform are you evaluating? (e.g., Kubernetes cluster, AWS environment, SaaS application, database cluster, CI/CD pipeline)
- What is the evaluation for? (e.g., release promotion, environment readiness, disaster recovery validation, compliance check)
- What environment(s) will be evaluated? (e.g., staging, production, specific cluster names)
Step 2: Define Components
Ask the user to list the health components they want to evaluate. Suggest categories based on their system type:
For Kubernetes-based systems:
- Deployments/StatefulSets health
- Pod status and restarts
- Node health and capacity
- Ingress/networking
- PersistentVolume status
- CronJob success rates
For cloud infrastructure (AWS/GCP/Azure):
- Compute instance health
- Database connectivity and replication
- Load balancer targets
- Certificate expiry
- Storage utilization
- Network connectivity
For SaaS/application systems:
- API endpoint health
- Database query performance
- Queue depth and processing rates
- Error rates and latency
- Authentication/SSO status
- Third-party dependency health
For CI/CD pipelines:
- Build success rates
- Test pass rates
- Deployment success rates
- Artifact availability
- Environment provisioning
For each component, ask:
- What tool or API provides the health data? (MCP tool, HTTP endpoint, CLI command, database query)
- What metrics matter? (counts, rates, durations, thresholds)
- What are the PASS/WARN/FAIL thresholds?
Step 3: Define Parameters
Ask about configurable parameters:
- time_window: What lookback period? (default: 24h)
- target: How is the environment identified?
- Any custom parameters specific to their system?
Step 4: Define Verdict Logic
Confirm the overall verdict mapping:
- READY: Which components must PASS?
- CAUTION: Which components can WARN without blocking?
- NOT_READY: Which component failures are blocking?
- Are any components optional (can SKIP without affecting verdict)?
Step 5: Generate the Skill
Using the gathered information, generate a complete SKILL.md following this structure:
---
name: promotion-eval-<system-name>
description: >
Evaluates <system> health for <purpose>.
Checks <component list summary>.
Use for <trigger scenarios>.
allowed-tools: <list of MCP tools or other tools needed>
---
# <System> Promotion Evaluation Skill
## Core Purpose
<One paragraph describing what this evaluation does>
## Parameters
- **time_window**: Lookback period (default: <default>)
- **target**: <how environment is identified>
<any custom parameters>
## Evaluation Procedure
Execute these phases sequentially. After each phase, record component status and findings.
Initialize a running JSON result conforming to the schema:
<JSON template with verdict, components, findings, recommendations>
---
### Phase N: <Component Name>
**Goal**: <what this phase checks>
1. <Step-by-step tool calls or queries>
**Metrics to record**:
- <metric>: <description>
**Verdict logic**:
- PASS: <criteria>
- WARN: <criteria>
- FAIL: <criteria>
---
<repeat for each component>
## Report Generation
<Markdown report template>
<JSON output template>
## Overall Verdict Logic
<Component-to-verdict mapping>
## Error Handling
- If a tool call fails, record the component as SKIP with a note
- Do not let one phase failure block subsequent phases
- Reuse data across phases when possible
Step 6: Review and Refine
Present the generated skill to the user and ask:
- Does the component list look complete?
- Are the thresholds appropriate?
- Should any phases be added or removed?
- Are the tool references correct?
Iterate until the user is satisfied.
Output
Write the final SKILL.md to the user's chosen location (default: current project's .claude/skills/ directory).
If the evaluation uses MCP tools, also note which MCP servers need to be configured.
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
write-canary-transformations
Write correct transform blocks for Mission Control canary checks including fan-out, inline, and generated canary patterns. Use when adding transformations to canary checks, splitting a single check into multiple results, modifying check output, or generating child canaries from discovered resources.
troubleshooting-health-checks
Debugs and troubleshoots Mission Control health checks by analyzing check configurations, reviewing failure patterns, and identifying root causes. Use when users ask about failing health checks, mention specific health check names or IDs, inquire why a health check is failing or unhealthy, or need help understanding health check errors and timeouts.
write-canary-tests
Write correct test blocks and assertions for Mission Control canary health checks. Use when creating canaries that need pass/fail conditions, adding test expressions, or writing assertions based on HTTP status, JSON response, exec output, or Kubernetes health.
troubleshooting-notifications
Investigates Mission Control notifications to identify root causes and provide remediation. Use when users mention notification IDs, ask about alerts or notifications, request help understanding "why did I get this notification", want to troubleshoot a specific alert, or ask about notification patterns and history. This skill retrieves notification details, analyzes historical patterns, routes to resource-specific troubleshooting (config items or health checks), correlates findings, and delivers actionable remediation steps with prevention recommendations.
troubleshooting-config-item
Troubleshoots infrastructure and application configuration items in Mission Control by diagnosing health issues, analyzing recent changes, and investigating resource relationships. Use when users ask about unhealthy or failing resources, mention specific config items by name or ID, inquire about Kubernetes pods/deployments/services, AWS EC2 instances/volumes, Azure VMs, or other infrastructure components. Also use when investigating why a resource is down, stopped, degraded, or showing errors, or when analyzing what changed that caused an issue.
promotion-eval-mission-control
Evaluates a Mission Control environment's platform health for release or promotion readiness. Checks health check pipelines, config scrapers, background jobs, notifications, event queues, and MC infrastructure. Use for pre-release checks, environment promotion, or environment status. Triggers: "check environment health", "is it ready for release", "pre-release health check", "evaluate environment", "promotion readiness", "environment status"
Didn't find tool you were looking for?