Agent skill
ops-llm
Local LLM health checks and cache management. Probe Ollama/vLLM/SGLang endpoints, clean model caches.
Stars
163
Forks
31
Install this agent skill to your Project
npx add-skill https://github.com/majiayu000/claude-skill-registry/tree/main/skills/data/ops-llm
Metadata
Additional technical details for this skill
- short description
- Local LLM health checks and cache management
SKILL.md
LLM Ops
Manage local LLM runtimes and caches.
Commands
bash
# Check all common LLM endpoints (Ollama, vLLM, SGLang)
./scripts/health.sh
# Check specific endpoint
./scripts/health.sh --target ollama:http://127.0.0.1:11434
# Continue even if some fail
./scripts/health.sh --warn-only
# Show cache sizes (dry-run)
./scripts/cache-clean.sh
# Actually clean caches
./scripts/cache-clean.sh --execute
# Clean additional path
./scripts/cache-clean.sh --path ~/.cache/torch --execute
Default Endpoints Checked
- Ollama:
http://127.0.0.1:11434 - vLLM:
http://127.0.0.1:8000 - SGLang:
http://127.0.0.1:30000
Default Cache Directories
~/.cache/ollama~/.cache/huggingface~/.cache/vllm
Environment Variables
| Variable | Default | Description |
|---|---|---|
LLM_HEALTH_TIMEOUT |
2 | Seconds to wait per endpoint |
LLM_CACHE_DIRS |
(see above) | Space-separated cache paths |
Didn't find tool you were looking for?