Agent skill

web-to-markdown

Use ONLY when the user explicitly says: 'use the skill web-to-markdown ...' (or 'use a skill web-to-markdown ...'). Converts webpage URLs to clean Markdown by calling the local web2md CLI (Puppeteer + Readability), suitable for JS-rendered pages.

View SKILL.md on GitHub Repository

Stars 23,776

Forks 2,298

Install this agent skill to your Project

npx add-skill https://github.com/davila7/claude-code-templates/tree/main/cli-tool/components/skills/development/web-to-markdown

Metadata

Additional technical details for this skill

version: 0.1.0

SKILL.md

web-to-markdown

Convert web pages to clean Markdown by driving a locally installed browser (via web2md).

Hard trigger gate (must enforce)

This skill MUST NOT be used unless the user explicitly wrote exactly a phrase like:

use the skill web-to-markdown ...
use a skill web-to-markdown ...

If the user did not explicitly request this skill by name, stop and ask them to re-issue the request including: use the skill web-to-markdown.

What this skill does

Handles JS-rendered pages (Puppeteer → user Chrome).
Works best with Chromium-family browsers (Chrome/Chromium/Brave/Edge) via puppeteer-core.
Extracts main content (Readability).
Converts to Markdown (Turndown) with cleaned links and optional YAML frontmatter.

Non-goals

Do not use Playwright or other browser automation stacks; the mechanism is web2md.

Inputs you should collect (ask only if missing)

url (or a list of URLs)
Output preference:
- Print to stdout (--print), OR
- Save to a file (--out ./file.md), OR
- Save to a directory (--out ./some-dir/ to auto-name by page title)
Optional rendering controls for tricky pages:
- --chrome-path <path> (if Chrome auto-detection fails)
- --interactive (show Chrome and pause so the user can complete human checks/login, then press Enter)
- --wait-until load|domcontentloaded|networkidle0|networkidle2
- --wait-for '<css selector>'
- --wait-ms <milliseconds>
- --headful (debug)
- --no-sandbox (sometimes required in containers/CI)
- --user-data-dir <dir> (login/session; use a dedicated profile directory)

Workflow

Confirm the user explicitly invoked the skill (use the skill web-to-markdown).
Validate URL(s) start with http:// or https://.
Ensure web2md is installed:
- Run: command -v web2md
- If missing, instruct the user to install it:
  - If available via npm: npm install -g web2md
  - If from source: Clone the repository, then run npm install && npm run build && npm link
Convert:
- Single URL → file:
  - web2md '<url>' --out ./page.md
- Single URL → auto-named file in directory:
  - mkdir -p ./out && web2md '<url>' --out ./out/
- Human verification / login walls (interactive):
  - mkdir -p ./out && web2md '<url>' --interactive --user-data-dir ./tmp/web2md-profile --out ./out/
  - Then: complete the check in the browser window and press Enter in the terminal to continue.
- Print to stdout:
  - web2md '<url>' --print
- Multiple URLs (batch):
  - Create output dir (e.g. ./out/) then run one web2md command per URL using --out ./out/
Validate output:
- If writing files, verify they exist and are non-empty (e.g. ls -la <path> and wc -c <path>).
Return:
- The saved file path(s), or the Markdown (stdout mode).

Defaults (recommended)

For most pages: --wait-until networkidle2
For heavy apps: start with --wait-until domcontentloaded --wait-ms 2000, then add --wait-for 'main' (or another stable selector) if needed.

Maintainer

davila7 Core maintainer

Source details

Full Name: davila7/claude-code-templates
Branch: main
Path in repo: cli-tool/components/skills/development/web-to-markdown
License: MIT License
Topics: claude-code anthropic anthropic-claude claude

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

davila7/claude-code-templates

verl-rl-training

Provides guidance for training LLMs with reinforcement learning using verl (Volcano Engine RL). Use when implementing RLHF, GRPO, PPO, or other RL algorithms for LLM post-training at scale with flexible infrastructure backends.

23,776 2,298

Explore

davila7/claude-code-templates

openrlhf-training

High-performance RLHF framework with Ray+vLLM acceleration. Use for PPO, GRPO, RLOO, DPO training of large models (7B-70B+). Built on Ray, vLLM, ZeRO-3. 2× faster than DeepSpeedChat with distributed architecture and GPU resource sharing.

23,776 2,298

Explore

davila7/claude-code-templates

gguf-quantization

GGUF format and llama.cpp quantization for efficient CPU/GPU inference. Use when deploying models on consumer hardware, Apple Silicon, or when needing flexible quantization from 2-8 bit without GPU requirements.

23,776 2,298

Explore

davila7/claude-code-templates

Claude Code Guide

Master guide for using Claude Code effectively. Includes configuration templates, prompting strategies "Thinking" keywords, debugging techniques, and best practices for interacting with the agent.

23,776 2,298

Explore

davila7/claude-code-templates

qdrant-vector-search

High-performance vector similarity search engine for RAG and semantic search. Use when building production RAG systems requiring fast nearest neighbor search, hybrid search with filtering, or scalable vector storage with Rust-powered performance.

23,776 2,298

Explore

davila7/claude-code-templates

behavioral-modes

AI operational modes (brainstorm, implement, debug, review, teach, ship, orchestrate). Use to adapt behavior based on task type.

23,776 2,298

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

Metadata

SKILL.md

web-to-markdown

Hard trigger gate (must enforce)

What this skill does

Non-goals

Inputs you should collect (ask only if missing)

Workflow

Defaults (recommended)

Recommended Agent Skills

verl-rl-training

openrlhf-training

gguf-quantization

Claude Code Guide

qdrant-vector-search

behavioral-modes