Agent skill

alma-scraper

Intelligent scraper for Australian youth justice sources. Discovers, extracts, and learns from government, Indigenous, research, and media sources.

Stars 163
Forks 31

Install this agent skill to your Project

npx add-skill https://github.com/majiayu000/claude-skill-registry/tree/main/skills/data/alma-scraper

SKILL.md

ALMA Intelligent Scraper

When to Use

  • Finding new youth justice information
  • Updating ALMA intelligence
  • Discovering new sources
  • Analyzing coverage gaps
  • Checking what's new in youth justice

Commands

Command Purpose Duration
quick Top 10 high-value sources 5 min
deep All 50+ sources with discovery 30-60 min
discover Follow discovered links Variable
source "QLD" Deep dive specific jurisdiction 15 min
gaps Show coverage gaps 2 min
status Current knowledge state Instant

Learning Cycle

SCRAPE → EXTRACT → EVALUATE → LEARN → STORE
         (Claude)   (Quality)  (Patterns)

Quality Signals

Signal Weight
Relevance (AU youth justice?) 30%
Novelty (new info?) 25%
Specificity (concrete details?) 20%
Evidence (research backed?) 15%
Actionability (useful?) 10%

Priority Formula

priority = (quality × 0.4) + (freshness_need × 0.3) + (coverage_gap × 0.3)

Sacred Boundaries

Never scrape: Private info, court records, social media, paywalled Always mark: Community Controlled, Indigenous orgs, cultural knowledge Always check: Consent level, cultural authority, data sovereignty

File References

Need Reference
Database schema references/database-schema.md
Extraction patterns references/extraction-patterns.md
Coverage tracking references/coverage-tracking.md
Implementation code references/implementation.md

Didn't find tool you were looking for?

Be as detailed as possible for better results