MCPs tagged with content extraction
-
ONES Wiki MCP Server
Spring AI MCP-based service for extracting and transforming ONES Wiki content for AI applications.
ONES Wiki MCP Server provides an MCP-compliant service built on Spring AI MCP for retrieving and converting ONES Wiki content into structured, AI-friendly formats. It supports authentication with ONES platform, automatic translation of Wiki URLs to API endpoints, and outputs processed content as Markdown. The service can be configured through properties, command line arguments, or environment variables, and integrates with MCP-compatible clients such as Claude Desktop.
- ⭐ 2
- MCP
- brianxiadong/ones-wiki-mcp-server
-
Web Analyzer MCP
Intelligent web content analysis and summarization via MCP.
Web Analyzer MCP is an MCP-compliant server designed for intelligent web content analysis and summarization. It leverages FastMCP to perform advanced web scraping, content extraction, and AI-powered question-answering using OpenAI models. The tool integrates with various developer IDEs, offering structured markdown output, essential content extraction, and smart Q&A functionality. Its features streamline content analysis workflows and support flexible model selection.
- ⭐ 2
- MCP
- kimdonghwi94/web-analyzer-mcp
-
DuckDuckGo Search MCP Server
A Model Context Protocol server for DuckDuckGo web search and intelligent content retrieval.
DuckDuckGo Search MCP Server provides web search capabilities through DuckDuckGo, with advanced content fetching and parsing tailored for large language models. It supports rate limiting, error handling, and delivers results in an LLM-friendly format. The server is designed for seamless integration with AI applications and tools like Claude Desktop, enabling enhanced web search and content extraction through the Model Context Protocol.
- ⭐ 637
- MCP
- nickclyde/duckduckgo-mcp-server
-
MCP Web Research Server
Bring real-time web research and Google search capabilities into Claude using MCP.
MCP Web Research Server acts as a Model Context Protocol (MCP) server, seamlessly integrating web research functionalities with Claude Desktop. It enables Google search, webpage content extraction, research session tracking, and screenshot capture, all accessible directly from Claude. The server supports interactive and guided research sessions, exposing session data and screenshots as MCP resources for enhanced context-aware AI interactions.
- ⭐ 284
- MCP
- mzxrai/mcp-webresearch
-
Content Core
AI-powered content extraction and processing platform with seamless model context integration.
Content Core is an AI-driven platform for extracting, formatting, transcribing, and summarizing content from a wide variety of sources including documents, media files, web pages, images, and archives. It offers intelligent auto-detection and engine selection to optimize processing, and provides integrations via CLI, Python library, Raycast extension, macOS Services, and the Model Context Protocol (MCP). The platform supports context-aware AI summaries and direct integration with Claude through MCP for enhanced user workflows. Users can access zero-install options and benefit from enhanced processing capabilities such as advanced PDF parsing, OCR, and smart summarization.
- ⭐ 85
- MCP
- lfnovo/content-core
-
mcp-read-website-fast
Fast, token-efficient web content extraction and Markdown conversion for AI agents.
Provides a Model Context Protocol (MCP) compatible server that rapidly fetches web pages, removes noise, and converts content to clean Markdown with link preservation. Designed for local use by AI-powered tools like IDEs and large language models, it offers optimized token usage, concurrency, polite crawling, and smart caching. Integrates with Claude Code, VS Code, JetBrains IDEs, Cursor, and other MCP clients.
- ⭐ 111
- MCP
- just-every/mcp-read-website-fast
-
Fetcher MCP
Intelligent web content fetching and extraction using Playwright.
Fetcher MCP is a server that fetches and extracts web page content using the Playwright headless browser while supporting the Model Context Protocol. It intelligently processes dynamic web pages with JavaScript, employs the Readability algorithm to extract main content, and supports output in both HTML and Markdown formats. Designed for seamless integration with AI model environments, it offers robust parallel processing, resource optimization, and flexible deployment options including Docker.
- ⭐ 906
- MCP
- jae-jae/fetcher-mcp
-
SearXNG MCP Server
MCP-compliant server integrating the SearXNG API for advanced web search capabilities
SearXNG MCP Server implements the Model Context Protocol and integrates the SearXNG API to provide extensive web search functionalities. It features intelligent caching, advanced content extraction, and multiple configurable search parameters such as language, time range, and safe search levels. The server exposes tools for both web searching and URL content reading, supporting detailed output customization through input parameters. Designed for seamless MCP deployments, it supports Docker and NPX-based installation with rich configuration options.
- ⭐ 321
- MCP
- ihor-sokoliuk/mcp-searxng
-
Open-WebSearch MCP Server
Multi-engine web search MCP server without API keys
Open-WebSearch MCP Server is a Model Context Protocol (MCP) compliant server offering web search functionalities using multiple search engines without the need for API keys or authentication. It provides structured search results with titles, URLs, and descriptions, and enables fetching of article content from supported sources such as CSDN and GitHub. The server supports extensive configuration through environment variables, including proxy settings and search engine customization. Designed for flexibility, it operates in both HTTP and stdio modes, making it suitable for integration into larger systems.
- ⭐ 463
- MCP
- Aas-ee/open-webSearch
-
MCP Content Summarizer Server
Intelligent multi-format content summarization via MCP interface.
MCP Content Summarizer Server provides intelligent summarization of various content types including text, web pages, PDF documents, and EPUB books using Google's Gemini 1.5 Pro model. Through the Model Context Protocol, it supports customizable, multi-language summaries with options for style and focus. It is designed for integration with applications as an MCP server and offers tools for both summarization and testing. The solution maintains key information while producing concise and context-aware summaries from diverse content sources.
- ⭐ 142
- MCP
- 0xshellming/mcp-summarizer