Agent skill

monitoring-analytics

Monitor Proxmox infrastructure health and performance. Track node statistics, analyze resource utilization, and identify optimization opportunities across your cluster.

Stars 163
Forks 31

Install this agent skill to your Project

npx add-skill https://github.com/majiayu000/claude-skill-registry/tree/main/skills/testing/monitoring-analytics-surrealwolf-proxmox-ve-mcp-1f0dbef8

SKILL.md

Monitoring & Analytics Skill

Monitor and analyze your Proxmox infrastructure health and performance.

What this skill does

This skill enables you to:

  • Get node statistics and performance metrics
  • Monitor CPU, memory, and disk utilization
  • Track network performance
  • Analyze VM/container performance
  • Monitor resource allocation efficiency
  • Identify performance bottlenecks
  • Generate performance reports
  • Track usage trends over time
  • Plan capacity based on metrics
  • Establish baselines and thresholds

When to use this skill

Use this skill when you need to:

  • Check cluster health and performance
  • Monitor node resource usage
  • Analyze VM/container performance
  • Identify performance bottlenecks
  • Troubleshoot performance issues
  • Plan capacity expansion
  • Generate performance reports
  • Establish monitoring baselines
  • Forecast resource needs
  • Optimize resource allocation

Available Tools

  • get_node_status - Get node statistics and performance
  • get_vm_status - Get VM performance metrics
  • get_container_status - Get container performance metrics
  • get_cluster_resources - Get overall cluster metrics

Typical Workflows

Infrastructure Health Check

  1. Use get_cluster_resources for overall health
  2. Use get_node_status for each node
  3. Use get_vm_status and get_container_status for workload analysis
  4. Generate comprehensive health report

Performance Analysis

  1. Use get_node_status to analyze node performance
  2. Use get_vm_status to check VM performance
  3. Identify high-utilization resources
  4. Analyze performance trends
  5. Recommend optimizations

Capacity Planning

  1. Use get_cluster_resources for current utilization
  2. Use get_node_status for detailed metrics
  3. Analyze growth trends
  4. Project future capacity needs
  5. Plan scaling or upgrades

Bottleneck Identification

  1. Use get_node_status to find high CPU/memory nodes
  2. Use get_vm_status for resource-hungry VMs
  3. Use get_storage for disk bottlenecks
  4. Analyze performance impact
  5. Recommend solutions

Example Questions

  • "What's the current cluster health and performance?"
  • "Which nodes are running at high utilization?"
  • "Show me the performance metrics for all VMs"
  • "Are there any performance bottlenecks?"
  • "Get a complete performance analysis report"
  • "Which containers are consuming the most resources?"
  • "What are the resource trends over time?"

Response Format

When using this skill, I provide:

  • Node statistics with CPU, memory, disk metrics
  • VM/container performance data
  • Utilization trends and analysis
  • Bottleneck identification
  • Capacity planning recommendations
  • Optimization suggestions

Best Practices

  • Monitor metrics continuously
  • Establish performance baselines
  • Set appropriate alert thresholds
  • Track metrics over time for trends
  • Identify and optimize peak usage periods
  • Balance load across nodes
  • Monitor both physical and virtual resources
  • Analyze before and after optimization
  • Keep historical data for trend analysis
  • Use metrics to justify capacity investments
  • Monitor network performance
  • Consider both current and future growth

Didn't find tool you were looking for?

Be as detailed as possible for better results