Agent skill
grey-haven-incident-response
Handle production incidents with SRE best practices including detection, investigation, mitigation, recovery, and postmortems. Use when dealing with production outages, SEV1/SEV2 incidents, creating postmortems, or updating runbooks.
Stars
163
Forks
31
Install this agent skill to your Project
npx add-skill https://github.com/majiayu000/claude-skill-registry/tree/main/skills/devops/grey-haven-incident-response-greyhaven-ai-claude-code-config
SKILL.md
Incident Response Skill
Handle production incidents with SRE best practices including detection, investigation, mitigation, recovery, and postmortems.
Description
Production incident response following SRE methodologies with incident timeline tracking, RCA documentation, and runbook updates.
What's Included
- Examples: SEV1 incident handling, postmortem templates
- Reference: SRE best practices, incident severity levels
- Templates: Incident reports, RCA documents, runbook updates
Use When
- Production outages
- SEV1/SEV2 incidents
- Postmortem creation
- Runbook updates
Related Agents
incident-responder
Skill Version: 1.0
Didn't find tool you were looking for?