Agent skills
enterprise-agent-ops

Agent skill

enterprise-agent-ops

Operate long-lived agent workloads with observability, security boundaries, and lifecycle management.

View SKILL.md on GitHub Repository

Stars 132,726

Forks 19,206

Install this agent skill to your Project

npx add-skill https://github.com/affaan-m/everything-claude-code/tree/main/skills/enterprise-agent-ops

SKILL.md

Enterprise Agent Ops

Use this skill for cloud-hosted or continuously running agent systems that need operational controls beyond single CLI sessions.

Operational Domains

runtime lifecycle (start, pause, stop, restart)
observability (logs, metrics, traces)
safety controls (scopes, permissions, kill switches)
change management (rollout, rollback, audit)

Baseline Controls

immutable deployment artifacts
least-privilege credentials
environment-level secret injection
hard timeout and retry budgets
audit log for high-risk actions

Metrics to Track

success rate
mean retries per task
time to recovery
cost per successful task
failure class distribution

Incident Pattern

When failure spikes:

freeze new rollout
capture representative traces
isolate failing route
patch with smallest safe change
run regression + security checks
resume gradually

Deployment Integrations

This skill pairs with:

PM2 workflows
systemd services
container orchestrators
CI/CD gates

Maintainer

affaan-m Core maintainer

Source details

Full Name: affaan-m/everything-claude-code
Branch: main
Path in repo: skills/enterprise-agent-ops
License: MIT License
Topics: claude-code anthropic claude mcp ai-agents developer-tools llm productivity

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

affaan-m/everything-claude-code

python-testing

Python testing best practices using pytest including fixtures, parametrization, mocking, coverage analysis, async testing, and test organization. Use when writing or improving Python tests.

132,726 19,206

Explore

affaan-m/everything-claude-code

golang-patterns

Go-specific design patterns and best practices including functional options, small interfaces, dependency injection, concurrency patterns, error handling, and package organization. Use when working with Go code to apply idiomatic Go patterns.

132,726 19,206

Explore

affaan-m/everything-claude-code

e2e-testing

Playwright E2E testing patterns, Page Object Model, configuration, CI/CD integration, artifact management, and flaky test strategies.

132,726 19,206

Explore

affaan-m/everything-claude-code

agentic-engineering

Operate as an agentic engineer using eval-first execution, decomposition, and cost-aware model routing. Use when AI agents perform most implementation work and humans enforce quality and risk controls.

132,726 19,206

Explore

affaan-m/everything-claude-code

api-design

REST API design patterns including resource naming, status codes, pagination, filtering, error responses, versioning, and rate limiting for production APIs.

132,726 19,206

Explore

affaan-m/everything-claude-code

python-patterns

Python-specific design patterns and best practices including protocols, dataclasses, context managers, decorators, async/await, type hints, and package organization. Use when working with Python code to apply Pythonic patterns.

132,726 19,206

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Enterprise Agent Ops

Operational Domains

Baseline Controls

Metrics to Track

Incident Pattern

Deployment Integrations

Recommended Agent Skills

python-testing

golang-patterns

e2e-testing

agentic-engineering

api-design

python-patterns