Agent skill

write-playbook

Generate Mission Control Playbook YAML from natural language. Use when users ask to create, write, modify, or explain playbooks, or want to convert manual workflows into automated playbook YAML.

View SKILL.md on GitHub Repository

Stars 0

Forks 0

Install this agent skill to your Project

npx add-skill https://github.com/flanksource/claude-code-plugin/tree/main/skills/write-playbook

SKILL.md

Write Playbook

Goal

Turn a user request into a valid Playbook YAML that can be applied in Mission Control.

How to Use

Identify the action type(s) the user needs (exec, http, sql, gitops, github, azureDevopsPipeline, notification, pod, logs, ai).
Ask only the minimum clarifying questions required to produce correct YAML (target config types, credentials, parameters, trigger mode).
Produce a single Playbook YAML in a fenced code block. Keep it minimal and runnable.
If the user mentions secrets, always use connection references or secret refs (do not inline sensitive values).
If a request cannot be expressed in a single action type, chain multiple actions in the actions list.

Inputs Checklist

What the playbook should do (action type)
Target config types (for configs: selector)
Credentials source (connection name or secret ref)
Parameters the user should provide at run time
Trigger mode: manual (default), event-driven (on:), or webhook

Output Rules

Output YAML only, in a single code block.
Use apiVersion: mission-control.flanksource.com/v1 and kind: Playbook.
Set metadata.name to a short, unique slug.
Always include the schema comment: # yaml-language-server: $schema=https://raw.githubusercontent.com/flanksource/duty/main/schema/openapi/playbook.schema.json

Quick Decision Tree

Use this to pick the right action type:

Run a shell command (kubectl, helm, scripts)? → exec
Call an external HTTP API? → http
Run a SQL query? → sql
Create a git commit / PR? → gitops
Trigger a GitHub Actions workflow? → github
Trigger an Azure DevOps pipeline? → azureDevopsPipeline
Send a notification (Slack, email, Teams)? → notification
Run a container / pod? → pod
Query logs (Loki, CloudWatch, OpenSearch, K8s)? → logs
AI-powered diagnosis or analysis? → ai

Golden Rules

Every action must have a unique name field.
Use $() delimiters for Go templates (NOT {{ }}).
For shell scripts that use $() for subshells, switch delimiters:
bash
```
# gotemplate: left-delim=$[[ right-delim=]]
```
Use connections.fromConfigItem: '$(.config.id)' for exec actions that need the config item's cluster context.
Reference parameters with $(.params.<name>) and config with $(.config.name), $(.config.tags.namespace), etc.

For multi-agent setups, use dynamic runsOn:

yaml

runsOn:
  - "$(if .agent)$(.agent.id)$(else)local$(end)"

Chain multiple actions in order; use if: success() to gate on previous step success.
Set timeout on long-running actions (default is 30s).

Template Context Variables

Variable	Description	Example
`.config.id`	Config item UUID	`$(.config.id)`
`.config.name`	Config item name	`$(.config.name)`
`.config.type`	Full type (e.g. `Kubernetes::Deployment`)	`$(.config.type)`
`.config.config_class`	Short class (e.g. `Deployment`)	`$(.config.config_class)`
`.config.tags.namespace`	Kubernetes namespace from tags	`$(.config.tags.namespace)`
`.config.config`	Raw JSON config object	`$(.config.config.spec.replicas)`
`.config.config \| jq "..."`	JQ query on config JSON	`$(.config.config \| jq ".spec.template.spec.containers[0].name")`
`.config.config \| toJSON \| neat \| json \| toYAML`	Clean YAML of config	Used in code editor defaults
`.params.<name>`	User-supplied parameter value	`$(.params.replicas)`
`.user.name`	Current user display name	`$(.user.name)`
`.user.email`	Current user email	`$(.user.email)`
`.agent.id`	Agent identifier	`$(.agent.id)`
`.run.id`	Current playbook run UUID	`$(.run.id)`
`.git.git.url`	Git repo URL from Flux origin	`$(.git.git.url)`
`.git.git.file`	File path from Flux origin	`$(.git.git.file)`
`getLastAction.result`	Previous action result	`$(getLastAction.result.stdout)`
`getLastAction.result.slack`	Slack-formatted result from AI action	`$(getLastAction.result.slack)`
`random.Alpha 8`	Random alphanumeric string	`$(random.Alpha 8)`
`time.Now.Format "..."`	Current time formatted	`$(time.Now.Format "2006-01-02")`
`strings.ToLower`	Lowercase string	`$(.config.config_class \| strings.ToLower)`

Parameter Types

Type	Description	Example
`text`	Single-line text input	Replicas count, names
`code`	Multi-line code editor	YAML input, scripts
`checkbox`	Boolean toggle	Enable/disable options
`list`	Dropdown with options	Time durations, roles
`config`	Config item picker	Select a ClusterRole, Namespace
`people`	People picker	Select users
`team`	Team picker	Select teams
`millicores`	CPU input (millicores)	CPU requests/limits
`bytes`	Memory input (bytes)	Memory requests/limits

Parameter Properties

Property	Description
`properties.multiline: 'true'`	Enable multiline for text
`properties.size: large`	Large editor for code params
`properties.colSpan: 4`	Grid column span (1-12)
`properties.options`	Array of `{label, value}` for `list` type
`properties.filter`	Resource filter for `config` type

Canonical Snippets

1) exec — kubectl command

yaml

apiVersion: mission-control.flanksource.com/v1
kind: Playbook
metadata:
  name: scale-deployment
spec:
  title: Scale
  icon: scale-out
  category: Kubernetes
  description: Scales a deployment using kubectl
  configs:
    - agent: all
      types:
        - Kubernetes::Deployment
        - Kubernetes::StatefulSet
  parameters:
    - name: replicas
      label: Replicas
      type: text
      default: "$(.config.config.spec.replicas)"
  runsOn:
    - "$(if .agent)$(.agent.id)$(else)local$(end)"
  actions:
    - name: kubectl scale
      exec:
        connections:
          fromConfigItem: '$(.config.id)'
        script: |
          kubectl scale $(.config.config_class | strings.ToLower) -n $(.config.tags.namespace) $(.config.name) --replicas=$(.params.replicas)

2) http — call an external API

yaml

actions:
  - name: call webhook
    http:
      url: 'https://api.example.com/deploy'
      method: POST
      headers:
        - name: Authorization
          valueFrom:
            secretKeyRef:
              name: api-credentials
              key: token
      body: |
        {"name": "$(.config.name)", "namespace": "$(.config.tags.namespace)"}
      templateBody: true

3) sql — run a database query

yaml

actions:
  - name: query database
    sql:
      connection: postgres-connection
      query: |
        SELECT count(*) as total FROM orders WHERE status = 'pending' AND created_at > now() - interval '1 hour'

4) gitops — create a PR with changes

yaml

actions:
  - name: create PR
    gitops:
      repo:
        url: '$(.git.git.url)'
        connection: github
        branch: update-$(random.Alpha 8)
      commit:
        author: '$(.user.name)'
        email: '$(.user.email)'
        message: 'chore: update $(.config.name)'
      pr:
        title: 'chore: update $(.config.name)'
      patches:
        - path: '$(.git.git.file)'
          yq: |
            select(.metadata.name=="$(.config.config | jq ".metadata.name")").spec.replicas = $(.params.replicas)

5) github — trigger a workflow

yaml

actions:
  - name: trigger deploy workflow
    github:
      repo: org/my-repo
      username: deploy-bot
      token:
        valueFrom:
          secretKeyRef:
            name: github-token
            key: token
      workflows:
        - id: deploy.yml
          ref: main
          input: '{"environment": "$(.params.environment)"}'

6) azureDevopsPipeline — trigger a pipeline

yaml

actions:
  - name: trigger build
    azureDevopsPipeline:
      org: my-org
      project: my-project
      token:
        valueFrom:
          secretKeyRef:
            name: azdo-token
            key: token
      pipeline:
        id: "42"
      parameters:
        templateParameters:
          environment: production

7) notification — send to Slack/email

yaml

actions:
  - name: notify slack
    notification:
      connection: slack-connection
      title: 'Deployment Update'
      message: '$(.config.name) was scaled to $(.params.replicas) replicas by $(.user.name)'

8) pod — run a container

yaml

actions:
  - name: run migration
    pod:
      name: 'db-migrate-$(.run.id)'
      spec:
        containers:
          - name: migrate
            image: myapp/migrate:latest
            command: ["./migrate", "up"]
        restartPolicy: Never
      artifacts:
        - path: '/tmp/output/*'

9) logs — query Kubernetes logs

yaml

actions:
  - name: fetch logs
    logs:
      kubernetes:
        kind: Pod
        apiVersion: v1
        namespace: '$(.config.tags.namespace)'
        name: '$(.config.name)'
        start: '-1h'
        limit: '1000'

10) ai — LLM diagnosis

yaml

actions:
  - name: diagnose
    timeout: 10m
    ai:
      connection: llm-connection
      systemPrompt: 'You are a Kubernetes troubleshooting expert.'
      prompt: '$(.params.prompt)'
      changes:
        since: 7d
      relationships:
        - depth: 5
          direction: all

Advanced Features

Approval Workflows

Require approval before actions execute:

yaml

spec:
  approval:
    type: any
    approvers:
      people:
        - admin@example.com
      teams:
        - platform-team
  actions:
    - name: dangerous operation
      exec:
        script: kubectl delete pod $(.config.name) -n $(.config.tags.namespace)

Event-Driven Triggers

Auto-trigger on config/component/canary events:

yaml

spec:
  on:
    config:
      - event: created
        filter: config.type == 'Kubernetes::Deployment'
      - event: updated
        filter: config.type == 'Kubernetes::Deployment' && change.change_type == 'diff'
      - event: unhealthy
      - event: deleted
    component:
      - event: unhealthy
    canary:
      - event: failed
        filter: check.type == 'http'
  actions:
    - name: auto-remediate
      exec:
        script: echo "Event triggered for $(.config.name)"

Webhook Triggers

Expose an HTTP endpoint that triggers the playbook:

yaml

spec:
  on:
    webhook:
      path: /deploy
      authentication:
        github:
          token:
            valueFrom:
              secretKeyRef:
                name: webhook-secret
                key: token
  actions:
    - name: handle webhook
      exec:
        script: echo "Webhook received"

Authentication options: basic, github, svix, jwt.

Step Delays

Delay a step (useful for cleanup after a timeout):

yaml

actions:
  - name: grant access
    exec:
      script: kubectl create rolebinding temp-access --user=$(.params.user) --role=edit -n $(.config.name)
  - name: revoke access
    if: 'success() && params.expiry != "0"'
    delay: "params.expiry"
    exec:
      script: kubectl delete rolebinding temp-access -n $(.config.name)

Step Conditions

Use CEL expressions to conditionally run steps:

yaml

actions:
  - name: diagnose
    ai:
      connection: llm
      systemPrompt: 'Diagnose this resource'
      prompt: '$(.params.prompt)'
      formats:
        - slack
  - name: send to slack
    if: success() && bool(params.notify_slack)
    notification:
      connection: slack
      title: Diagnosis Report
      message: "$(getLastAction.result.slack)"

Retry

Retry a failing action:

yaml

actions:
  - name: flaky operation
    retry:
      limit: 3
      duration: 10s
      exponent:
        multiplier: 2
    exec:
      script: curl -f https://api.example.com/health

Artifacts

Capture files produced by exec or pod actions:

yaml

actions:
  - name: collect diagnostics
    exec:
      script: |
        mkdir -p /tmp/diag
        kubectl logs $(.config.name) -n $(.config.tags.namespace) > /tmp/diag/logs.txt
        kubectl describe pod $(.config.name) -n $(.config.tags.namespace) > /tmp/diag/describe.txt
      artifacts:
        - path: '/tmp/diag/*'

Shell Script Delimiter Switching

When your bash script uses $() for command substitution, switch Go template delimiters:

yaml

actions:
  - name: complex script
    exec:
      script: |
        # gotemplate: left-delim=$[[ right-delim=]]
        ns=$[[.config.tags.namespace]]
        name=$[[.config.name]]
        pods=$(kubectl get pods -n $ns -l app=$name -o name)
        echo "$pods"

Common Mistakes to Avoid

Using {{ }} delimiters instead of $() for Go templates.
Forgetting connections.fromConfigItem on exec actions that need cluster context.
Missing timeout on long-running actions (AI, complex scripts).
Hardcoding credentials instead of using connections or secret refs.
Missing runsOn for multi-agent setups (playbook runs on server by default).
Using $() in shell scripts without switching delimiters.
Forgetting templateBody: true on HTTP actions with templated body.
Missing name on action steps.

Authoring Workflow

When user asks for a playbook:

Identify the action type(s) from the Quick Decision Tree.
Determine target config types for the configs: selector.
Identify parameters the user should provide at run time.
Determine trigger mode (manual, event, webhook).
Pick the right snippets and adapt to the user's use case.
Add advanced features if needed (approval, delay, retry, artifacts).
Return full YAML block ready to apply.

Reference

Schema (Bundled)

@skills/write-playbook/references/schemas/playbook-spec.schema.json

Remote: playbook-spec.schema.json

Documentation

Playbooks guide: https://flanksource.com/docs/guide/playbooks/llms.txt
Playbooks reference: https://flanksource.com/docs/reference/playbooks/llms.txt
Go template reference: https://flanksource.com/docs/reference/scripting/gotemplate/llms.txt
CEL expression reference: https://flanksource.com/docs/reference/scripting/cel/llms.txt
Connections reference: https://flanksource.com/docs/reference/connections/llms.txt

Maintainer

flanksource Core maintainer

Source details

Full Name: flanksource/claude-code-plugin
Branch: main
Path in repo: skills/write-playbook

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

flanksource/claude-code-plugin

write-canary-transformations

Write correct transform blocks for Mission Control canary checks including fan-out, inline, and generated canary patterns. Use when adding transformations to canary checks, splitting a single check into multiple results, modifying check output, or generating child canaries from discovered resources.

0 0

Explore

flanksource/claude-code-plugin

troubleshooting-health-checks

Debugs and troubleshoots Mission Control health checks by analyzing check configurations, reviewing failure patterns, and identifying root causes. Use when users ask about failing health checks, mention specific health check names or IDs, inquire why a health check is failing or unhealthy, or need help understanding health check errors and timeouts.

0 0

Explore

flanksource/claude-code-plugin

write-canary-tests

Write correct test blocks and assertions for Mission Control canary health checks. Use when creating canaries that need pass/fail conditions, adding test expressions, or writing assertions based on HTTP status, JSON response, exec output, or Kubernetes health.

0 0

Explore

flanksource/claude-code-plugin

troubleshooting-notifications

Investigates Mission Control notifications to identify root causes and provide remediation. Use when users mention notification IDs, ask about alerts or notifications, request help understanding "why did I get this notification", want to troubleshoot a specific alert, or ask about notification patterns and history. This skill retrieves notification details, analyzes historical patterns, routes to resource-specific troubleshooting (config items or health checks), correlates findings, and delivers actionable remediation steps with prevention recommendations.

0 0

Explore

flanksource/claude-code-plugin

troubleshooting-config-item

Troubleshoots infrastructure and application configuration items in Mission Control by diagnosing health issues, analyzing recent changes, and investigating resource relationships. Use when users ask about unhealthy or failing resources, mention specific config items by name or ID, inquire about Kubernetes pods/deployments/services, AWS EC2 instances/volumes, Azure VMs, or other infrastructure components. Also use when investigating why a resource is down, stopped, degraded, or showing errors, or when analyzing what changed that caused an issue.

0 0

Explore

flanksource/claude-code-plugin

promotion-eval-create

Create a promotion evaluation template for any system by gathering requirements through structured questions and generating a reusable evaluation skill. Use when users ask to create a promotion check, release readiness evaluation, environment health template, or want to build a custom evaluation workflow for systems beyond Mission Control.

0 0

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Write Playbook

Goal

How to Use

Inputs Checklist

Output Rules

Quick Decision Tree

Golden Rules

Template Context Variables

Parameter Types

Parameter Properties

Canonical Snippets

1) exec — kubectl command

2) http — call an external API

3) sql — run a database query

4) gitops — create a PR with changes

5) github — trigger a workflow

6) azureDevopsPipeline — trigger a pipeline

7) notification — send to Slack/email

8) pod — run a container

9) logs — query Kubernetes logs

10) ai — LLM diagnosis

Advanced Features

Approval Workflows

Event-Driven Triggers

Webhook Triggers

Step Delays

Step Conditions

Retry

Artifacts

Shell Script Delimiter Switching

Common Mistakes to Avoid

Authoring Workflow

Reference

Schema (Bundled)

Documentation

Recommended Agent Skills

write-canary-transformations

troubleshooting-health-checks

write-canary-tests

troubleshooting-notifications

troubleshooting-config-item

promotion-eval-create