Agent skill
article-extractor
Extract clean article content from URLs and save as markdown. Triggers when user provides a webpage URL and wants to download it, extract content, get a clean version without ads, capture an article for offline reading, save an article, grab content from a page, archive a webpage, clip an article, or read something later. Handles blog posts, news articles, tutorials, documentation pages, and similar web content. Supports Wayback Machine for dead links or paywalled content. This skill handles the entire workflow - do NOT use web_fetch or other tools first, just call the extraction script directly with the URL.
Install this agent skill to your Project
npx add-skill https://github.com/jrajasekera/claude-skills/tree/main/skills/article-extractor
SKILL.md
Article Extractor
Extract clean article content from URLs, removing ads, navigation, and clutter. Multi-tool fallback ensures reliability.
Workflow
When user provides a URL to download/extract:
- Call the extraction script directly with the URL (do NOT fetch the URL first with web_fetch)
- Script handles fetching, extraction, and saving automatically
- Returns clean markdown file with frontmatter
Usage
# Basic extraction
scripts/extract-article.sh "https://example.com/article"
# Specify output location
scripts/extract-article.sh "https://example.com/article" -o my-article.md -d ~/Documents
# Try Wayback Machine if original fails
scripts/extract-article.sh "https://example.com/article" --wayback
Make script executable if needed: chmod +x scripts/extract-article.sh
Key Options
-o <file>- Output filename-d <dir>- Output directory-w, --wayback- Try Wayback Machine if extraction fails-t <tool>- Force tool:jina,trafilatura,readability,fallback-q- Quiet mode
For complete options, exit codes, tool details, and examples, see references/tools-and-options.md.
Common Failures
- Exit 3 (access denied): Paywall or login required - try
--wayback - Exit 4 (no content): Heavy JavaScript - try different
--tool - Exit 2 (network): Connection issue - check URL
Local Tools (Optional)
For offline extraction: scripts/install-deps.sh
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
z-ai-api
Z.ai API integration for building applications with GLM models. Use when working with Z.ai/ZhipuAI APIs for: (1) Chat completions with GLM-4.7/4.6/4.5 models, (2) Vision/multimodal tasks with GLM-4.6V, (3) Image generation with GLM-Image or CogView-4, (4) Video generation with CogVideoX-3 or Vidu models, (5) Audio transcription with GLM-ASR-2512, (6) Function calling and tool use, (7) Web search integration, (8) Translation, slide/poster generation agents. Triggers: Z.ai, ZhipuAI, GLM, BigModel, Zhipu, CogVideoX, CogView, Vidu.
openrouter-api
OpenRouter API integration for unified access to 400+ LLM models from 70+ providers. Use when building applications that need to call OpenRouter's API for chat completions, streaming, tool calling, structured outputs, or model routing. Triggers on OpenRouter, model routing, multi-model, provider fallbacks, or when users need to access multiple LLM providers through a single API.
sqlite-optimization
Optimize SQLite database performance through configuration, schema design, indexing, and query tuning. Use when users ask to improve SQLite speed, reduce latency, optimize queries, configure PRAGMAs, fix slow queries, handle concurrency, optimize writes/inserts, or tune SQLite for production. Triggers on mentions of SQLite performance, slow queries, PRAGMA settings, WAL mode, indexing strategies, bulk inserts, or database maintenance (VACUUM, ANALYZE).
codex-review
Use after creating design docs or implementation plans to get cross-agent review from Codex. Auto-triggers for non-trivial plans; asks first for simple changes. Captures feedback, addresses critical issues, presents minor concerns for user decision.
pandoc-converter
Convert documents between formats using Pandoc. Use when the user asks to convert files between formats like markdown, docx, html, pdf, latex, epub, rtf, csv, xlsx, or pptx. Triggers on requests like "convert this to Word", "export as PDF", "turn this markdown into HTML", or "convert the CSV to a table".
venice-ai-api
Venice.ai API integration for privacy-first AI applications. Use when building applications with Venice.ai API for chat completions, image generation, video generation, text-to-speech, speech-to-text, or embeddings. Triggers on Venice, Venice.ai, uncensored AI, privacy-first AI, or when users need OpenAI-compatible API with uncensored models.
Didn't find tool you were looking for?