Agent skill

grey-haven-incident-response

Handle production incidents with SRE best practices including detection, investigation, mitigation, recovery, and postmortems. Use when dealing with production outages, SEV1/SEV2 incidents, creating postmortems, or updating runbooks.

Stars 163
Forks 31

Install this agent skill to your Project

npx add-skill https://github.com/majiayu000/claude-skill-registry/tree/main/skills/devops/grey-haven-incident-response

SKILL.md

Incident Response Skill

Handle production incidents with SRE best practices including detection, investigation, mitigation, recovery, and postmortems.

Description

Production incident response following SRE methodologies with incident timeline tracking, RCA documentation, and runbook updates.

What's Included

  • Examples: SEV1 incident handling, postmortem templates
  • Reference: SRE best practices, incident severity levels
  • Templates: Incident reports, RCA documents, runbook updates

Use When

  • Production outages
  • SEV1/SEV2 incidents
  • Postmortem creation
  • Runbook updates

Related Agents

  • incident-responder

Skill Version: 1.0

Didn't find tool you were looking for?

Be as detailed as possible for better results