Agent skill
monitoring-analytics
Monitor Proxmox infrastructure health and performance. Track node statistics, analyze resource utilization, and identify optimization opportunities across your cluster.
Stars
163
Forks
31
Install this agent skill to your Project
npx add-skill https://github.com/majiayu000/claude-skill-registry/tree/main/skills/testing/monitoring-analytics-surrealwolf-proxmox-ve-mcp-1f0dbef8
SKILL.md
Monitoring & Analytics Skill
Monitor and analyze your Proxmox infrastructure health and performance.
What this skill does
This skill enables you to:
- Get node statistics and performance metrics
- Monitor CPU, memory, and disk utilization
- Track network performance
- Analyze VM/container performance
- Monitor resource allocation efficiency
- Identify performance bottlenecks
- Generate performance reports
- Track usage trends over time
- Plan capacity based on metrics
- Establish baselines and thresholds
When to use this skill
Use this skill when you need to:
- Check cluster health and performance
- Monitor node resource usage
- Analyze VM/container performance
- Identify performance bottlenecks
- Troubleshoot performance issues
- Plan capacity expansion
- Generate performance reports
- Establish monitoring baselines
- Forecast resource needs
- Optimize resource allocation
Available Tools
get_node_status- Get node statistics and performanceget_vm_status- Get VM performance metricsget_container_status- Get container performance metricsget_cluster_resources- Get overall cluster metrics
Typical Workflows
Infrastructure Health Check
- Use
get_cluster_resourcesfor overall health - Use
get_node_statusfor each node - Use
get_vm_statusandget_container_statusfor workload analysis - Generate comprehensive health report
Performance Analysis
- Use
get_node_statusto analyze node performance - Use
get_vm_statusto check VM performance - Identify high-utilization resources
- Analyze performance trends
- Recommend optimizations
Capacity Planning
- Use
get_cluster_resourcesfor current utilization - Use
get_node_statusfor detailed metrics - Analyze growth trends
- Project future capacity needs
- Plan scaling or upgrades
Bottleneck Identification
- Use
get_node_statusto find high CPU/memory nodes - Use
get_vm_statusfor resource-hungry VMs - Use
get_storagefor disk bottlenecks - Analyze performance impact
- Recommend solutions
Example Questions
- "What's the current cluster health and performance?"
- "Which nodes are running at high utilization?"
- "Show me the performance metrics for all VMs"
- "Are there any performance bottlenecks?"
- "Get a complete performance analysis report"
- "Which containers are consuming the most resources?"
- "What are the resource trends over time?"
Response Format
When using this skill, I provide:
- Node statistics with CPU, memory, disk metrics
- VM/container performance data
- Utilization trends and analysis
- Bottleneck identification
- Capacity planning recommendations
- Optimization suggestions
Best Practices
- Monitor metrics continuously
- Establish performance baselines
- Set appropriate alert thresholds
- Track metrics over time for trends
- Identify and optimize peak usage periods
- Balance load across nodes
- Monitor both physical and virtual resources
- Analyze before and after optimization
- Keep historical data for trend analysis
- Use metrics to justify capacity investments
- Monitor network performance
- Consider both current and future growth
Didn't find tool you were looking for?