Agent skill
guard
Protect Claude Code sessions from context overflow by running a background daemon that monitors session size and auto-prunes before compaction hits. Use when the user says "guard", "protect session", "context getting long", "prevent compaction", "session management", or is running agent teams that need continuous context protection.
Install this agent skill to your Project
npx add-skill https://github.com/Ruya-AI/cozempic/tree/main/plugin/skills/guard
SKILL.md
Start the cozempic guard daemon for continuous session protection.
Default (recommended)
cozempic guard --daemon --threshold 50 -rx standard --interval 30
This runs in the background and:
- Checkpoints team state every 30 seconds
- At 60% of threshold (30MB): applies gentle prune, no reload
- At threshold (50MB): applies full prescription + auto-reload with team state preserved
For agent teams
Guard mode is essential for sessions running agent teams. Without it, auto-compaction triggers and the lead agent loses team state (TeamCreate, SendMessage, tasks are discarded).
Options
--threshold N— hard threshold in MB (default: 50)--soft-threshold N— soft threshold in MB (default: 60% of hard)--threshold-tokens N— hard threshold in tokens (fires whichever hits first)--no-reload— prune without restarting Claude--no-reactive— disable kqueue/polling file watcher-rx NAME— prescription at hard threshold (default: standard)
Check status and stop
The daemon writes to /tmp/cozempic_guard_*.log. Check with:
ls /tmp/cozempic_guard_*.pid 2>/dev/null # is it running?
tail -20 /tmp/cozempic_guard_*.log 2>/dev/null # recent activity
kill "$(cat /tmp/cozempic_guard_*.pid)" # stop it
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
reload
Treat the current session and auto-resume in a new terminal window.
treat
Prune bloated session with a prescription. Removes progress ticks, stale reads, duplicate content, and more.
diagnose
Analyze Claude Code session bloat — shows token count, context usage %, and bloat breakdown. Use when the user asks about session size, context usage, or when you notice the context window is getting full.
doctor
Run health checks on Claude Code configuration and sessions. Use when troubleshooting Claude Code issues.
verl-rl-training
Provides guidance for training LLMs with reinforcement learning using verl (Volcano Engine RL). Use when implementing RLHF, GRPO, PPO, or other RL algorithms for LLM post-training at scale with flexible infrastructure backends.
openrlhf-training
High-performance RLHF framework with Ray+vLLM acceleration. Use for PPO, GRPO, RLOO, DPO training of large models (7B-70B+). Built on Ray, vLLM, ZeRO-3. 2× faster than DeepSpeedChat with distributed architecture and GPU resource sharing.
Didn't find tool you were looking for?