Agent skill

ops-llm

Local LLM health checks and cache management. Probe Ollama/vLLM/SGLang endpoints, clean model caches.

Stars 163
Forks 31

Install this agent skill to your Project

npx add-skill https://github.com/majiayu000/claude-skill-registry/tree/main/skills/data/ops-llm

Metadata

Additional technical details for this skill

short description
Local LLM health checks and cache management

SKILL.md

LLM Ops

Manage local LLM runtimes and caches.

Commands

bash
# Check all common LLM endpoints (Ollama, vLLM, SGLang)
./scripts/health.sh

# Check specific endpoint
./scripts/health.sh --target ollama:http://127.0.0.1:11434

# Continue even if some fail
./scripts/health.sh --warn-only

# Show cache sizes (dry-run)
./scripts/cache-clean.sh

# Actually clean caches
./scripts/cache-clean.sh --execute

# Clean additional path
./scripts/cache-clean.sh --path ~/.cache/torch --execute

Default Endpoints Checked

  • Ollama: http://127.0.0.1:11434
  • vLLM: http://127.0.0.1:8000
  • SGLang: http://127.0.0.1:30000

Default Cache Directories

  • ~/.cache/ollama
  • ~/.cache/huggingface
  • ~/.cache/vllm

Environment Variables

Variable Default Description
LLM_HEALTH_TIMEOUT 2 Seconds to wait per endpoint
LLM_CACHE_DIRS (see above) Space-separated cache paths

Didn't find tool you were looking for?

Be as detailed as possible for better results