MCPs tagged with web-scraping
-
WebScraping.AI MCP Server
MCP server for advanced web scraping and AI-driven data extraction
WebScraping.AI MCP Server implements the Model Context Protocol to provide web data extraction and question answering functionalities. It integrates with WebScraping.AI to offer robust tools for retrieving, rendering, and parsing web content, including structured data and natural language answers from web pages. It supports JavaScript rendering, proxy management, device emulation, and custom extraction configurations, making it suitable for both individual and team deployments in AI-assisted workflows.
- ⭐ 33
- MCP
- webscraping-ai/webscraping-ai-mcp-server
-
Web Analyzer MCP
Intelligent web content analysis and summarization via MCP.
Web Analyzer MCP is an MCP-compliant server designed for intelligent web content analysis and summarization. It leverages FastMCP to perform advanced web scraping, content extraction, and AI-powered question-answering using OpenAI models. The tool integrates with various developer IDEs, offering structured markdown output, essential content extraction, and smart Q&A functionality. Its features streamline content analysis workflows and support flexible model selection.
- ⭐ 2
- MCP
- kimdonghwi94/web-analyzer-mcp
-
GeekNews MCP Server
Fetches and serves GeekNews articles and weekly news via Model Context Protocol.
GeekNews MCP Server provides an MCP-compliant server to extract articles and weekly news from GeekNews, using Python and BeautifulSoup for web scraping. It supports scheduled automatic parsing and caching of data to minimize server load and ensure efficient retrieval. The server exposes tools to obtain articles by type and weekly news, offering structured metadata for each item. Integration is facilitated via Smithery and is compatible with the MCP ecosystem.
- ⭐ 16
- MCP
- the0807/GeekNews-MCP-Server
-
mcp-read-website-fast
Fast, token-efficient web content extraction and Markdown conversion for AI agents.
Provides a Model Context Protocol (MCP) compatible server that rapidly fetches web pages, removes noise, and converts content to clean Markdown with link preservation. Designed for local use by AI-powered tools like IDEs and large language models, it offers optimized token usage, concurrency, polite crawling, and smart caching. Integrates with Claude Code, VS Code, JetBrains IDEs, Cursor, and other MCP clients.
- ⭐ 111
- MCP
- just-every/mcp-read-website-fast
-
Fetcher MCP
Intelligent web content fetching and extraction using Playwright.
Fetcher MCP is a server that fetches and extracts web page content using the Playwright headless browser while supporting the Model Context Protocol. It intelligently processes dynamic web pages with JavaScript, employs the Readability algorithm to extract main content, and supports output in both HTML and Markdown formats. Designed for seamless integration with AI model environments, it offers robust parallel processing, resource optimization, and flexible deployment options including Docker.
- ⭐ 906
- MCP
- jae-jae/fetcher-mcp
-
MCP Server for the RAG Web Browser Actor
Local MCP server enabling LLMs to browse and extract web content via RAG Web Browser integration.
Implements a Model Context Protocol (MCP) server designed for integration with LLMs and Retrieval-Augmented Generation (RAG) pipelines, providing automated web search and web page extraction capabilities. Runs locally and interacts with the RAG Web Browser Actor in Standby mode, responding to queries by fetching, scraping, and returning cleaned content from the web as Markdown. Supports Google Search queries and direct URL fetching through a standardized 'search' tool interface, offering multiple output formats and programmable arguments. Deprecated in favor of mcp.apify.com but illustrates MCP server use for local web browsing integrations.
- ⭐ 194
- MCP
- apify/mcp-server-rag-web-browser
-
Website Downloader MCP Server
Download and archive entire websites as local, browsable directories via MCP.
Website Downloader MCP Server provides a Model Context Protocol (MCP) compatible server that enables users to download entire websites using wget. It preserves the website structure, rewrites internal links for local browsing, and allows for custom download depth and output directories. The server is intended for integration into larger systems or AI workflows that utilize MCP servers. Installation instructions and usage examples are provided for multiple platforms.
- ⭐ 144
- MCP
- pskill9/website-downloader