Agent skills
kubernetes-health

Agent skill

kubernetes-health

Comprehensive Kubernetes cluster health diagnostics using dynamic API discovery. Use when checking cluster health, troubleshooting K8s issues, or running health assessments.

View SKILL.md on GitHub Repository

Stars 163

Forks 31

Install this agent skill to your Project

npx add-skill https://github.com/majiayu000/claude-skill-registry/tree/main/skills/development/kubernetes-health

SKILL.md

Kubernetes Health Diagnostics

Dynamic, discovery-driven health checks for any Kubernetes cluster configuration

BEFORE YOU START

Impact	Value
Token Savings	~70% vs manual kubectl exploration
Setup Time	0 min (uses existing kubectl config)
Coverage	Adapts to installed operators automatically

Known Issues Prevented

Problem	Root Cause	How This Skill Helps
Missing operator health	Static checklists miss CRDs	Dynamic API discovery detects all installed operators
Stale diagnostics	Manual checks become outdated	Real-time cluster API interrogation
Incomplete coverage	Unknown cluster configuration	Automatically activates relevant sub-agents

Quick Start

Verify cluster access: Ensure kubectl is configured and can reach your cluster
Run discovery: Execute discover_apis.py to detect installed operators
Dispatch agents: Use the orchestrator to run health checks based on discovery

bash

# Step 1: Verify kubectl context
kubectl config current-context
kubectl cluster-info

# Step 2: Run API discovery
uv run .claude/skills/kubernetes-health/scripts/discover_apis.py

# Step 3: Review detected operators and dispatch health agents

Critical Rules

Always

Verify kubectl context before running health checks
Use read-only kubectl commands (get, describe, logs)
Run core health checks before operator-specific checks
Aggregate results using the provided scoring methodology

Never

Modify cluster resources during health checks
Expose secret values in health reports (metadata only)
Skip context verification for production clusters
Assume operator presence without API discovery

Common Mistakes

Mistake	Why It's Wrong	Correct Approach
Hardcoding operator checks	Misses installed operators, checks missing ones	Use API discovery to detect what's installed
Sequential agent dispatch	Slow for multi-operator clusters	Run operator agents in parallel (same priority)
Raw kubectl output	Token inefficient, hard to parse	Use scripts for condensed JSON output

Bundled Resources

Scripts

Script	Purpose
`scripts/discover_apis.py`	Discovers all API groups and detects installed operators
`scripts/health_orchestrator.py`	Maps discovered APIs to specialized health agents
`scripts/aggregate_report.py`	Aggregates multi-agent results into unified report

References

File	Contents
`references/operator-checks.md`	Detailed health checks for each supported operator
`references/health-scoring.md`	Scoring methodology and weight assignments

Templates

File	Purpose
`templates/health-report.json`	JSON schema for health report output

Dependencies

Required

Package	Version	Purpose
kubectl	Latest	Cluster interaction
Python	>= 3.11	Script execution
uv	Latest	Python script runner

Optional

Package	Version	Purpose
kubernetes	>= 28.1.0	Python client (for advanced discovery)

Supported Operators

The skill automatically detects and dispatches specialized agents for:

Operator	API Group	Agent
Core K8s	(always)	k8s-core-health-agent
Crossplane	crossplane.io	k8s-crossplane-health-agent
ArgoCD	argoproj.io	k8s-argocd-health-agent
Cert-Manager	cert-manager.io	k8s-certmanager-health-agent
Prometheus	monitoring.coreos.com	k8s-prometheus-health-agent

Health Scoring

Status	Score Range	Criteria
HEALTHY	90-100	All checks pass, no warnings
DEGRADED	60-89	Some warnings, no critical issues
CRITICAL	0-59	Critical issues affecting availability

Troubleshooting

kubectl connection issues

bash

# Verify context
kubectl config current-context

# Test connectivity
kubectl cluster-info

# Check permissions
kubectl auth can-i get pods --all-namespaces

Discovery returns empty results

Ensure cluster is reachable
Check RBAC permissions for API discovery
Verify kubectl version compatibility

Agent dispatch failures

Confirm discovered API group matches agent trigger
Check agent file exists in .claude/agents/specialized/kubernetes/
Review agent tool restrictions

Setup Checklist

kubectl configured and connected to cluster
Python 3.11+ installed
uv installed for script execution
Read permissions on cluster resources
Agent files present in .claude/agents/specialized/kubernetes/

Maintainer

majiayu000 Core maintainer

Source details

Full Name: majiayu000/claude-skill-registry
Branch: main
Path in repo: skills/development/kubernetes-health
License: MIT License

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

majiayu000/claude-skill-registry

agent-ops-spec

Manage specification documents in .agent/specs/. Use when user provides requirements, acceptance criteria, or feature descriptions that need to be tracked and validated against implementation.

163 31

Explore

majiayu000/claude-skill-registry

agent-ops-state

Maintain .agent state files. Use at session start, after meaningful steps, and before concluding: read/update constitution/memory/focus/issues/baseline consistently.

163 31

Explore

majiayu000/claude-skill-registry

agent-ops-spec

Manage specification documents in .agent/specs/. Use when user provides requirements, acceptance criteria, or feature descriptions that need to be tracked and validated against implementation.

163 31

Explore

majiayu000/claude-skill-registry

agent-ops-testing

Test strategy, execution, and coverage analysis. Use when designing tests, running test suites, or analyzing test results beyond baseline checks.

163 31

Explore

majiayu000/claude-skill-registry

agent-ops-testing

Test strategy, execution, and coverage analysis. Use when designing tests, running test suites, or analyzing test results beyond baseline checks.

163 31

Explore

majiayu000/claude-skill-registry

agent-ops-state

Maintain .agent state files. Use at session start, after meaningful steps, and before concluding: read/update constitution/memory/focus/issues/baseline consistently.

163 31

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Kubernetes Health Diagnostics

BEFORE YOU START

Known Issues Prevented

Quick Start

Critical Rules

Always

Never

Common Mistakes

Bundled Resources

Scripts

References

Templates

Dependencies

Required

Optional

Supported Operators

Health Scoring

Troubleshooting

kubectl connection issues

Discovery returns empty results

Agent dispatch failures

Setup Checklist

Recommended Agent Skills

agent-ops-spec

agent-ops-state

agent-ops-spec

agent-ops-testing

agent-ops-testing

agent-ops-state