Agent skill
antidote-threat-handler
Stars
2
Forks
0
Install this agent skill to your Project
npx add-skill https://github.com/starwreckntx/IRP__METHODOLOGIES-/tree/main/irp_swarm_console/skills/antidote-threat-handler
SKILL.md
Antidote Threat Handler
Skill Metadata
- Name: antidote-threat-handler
- Category: Adversarial Testing
- Version: 1.0.0
Purpose
Detect and handle behavioral drift, cognitive traps, and potential manipulation attempts.
Protocol
Threat Categories
- Sycophancy Drift - Excessive agreement patterns
- Cognitive Traps - Logical manipulation attempts
- Identity Erosion - Persona boundary violations
- Consent Violations - Unauthorized action requests
Detection Mechanisms
- Pattern matching against known trap signatures
- Sentiment drift monitoring
- Consistency checking against baseline
- Boundary violation alerting
Response Protocol
- Identify threat type and severity
- Log detection with evidence
- Apply appropriate countermeasure
- Report to audit trail
Output Format
json
{
"threat_detected": true,
"threat_type": "CATEGORY",
"severity": "HIGH|MEDIUM|LOW",
"evidence": "Description",
"countermeasure_applied": "Action taken"
}
Behavioral Calibration
yaml
vigilance_level: 0.9
false_positive_tolerance: 0.1
auto_response: true
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
transmission-packet-forge
2
0
Explore
rtc-consensus-synthesis
2
0
Explore
artist
2
0
Explore
devils-advocate-kitchen
2
0
Explore
stress-tester
2
0
Explore
devils-advocate
2
0
Explore
Didn't find tool you were looking for?