MCPs tagged with document processing
-
Content Core
AI-powered content extraction and processing platform with seamless model context integration.
Content Core is an AI-driven platform for extracting, formatting, transcribing, and summarizing content from a wide variety of sources including documents, media files, web pages, images, and archives. It offers intelligent auto-detection and engine selection to optimize processing, and provides integrations via CLI, Python library, Raycast extension, macOS Services, and the Model Context Protocol (MCP). The platform supports context-aware AI summaries and direct integration with Claude through MCP for enhanced user workflows. Users can access zero-install options and benefit from enhanced processing capabilities such as advanced PDF parsing, OCR, and smart summarization.
- ⭐ 85
- MCP
- lfnovo/content-core
-
Graphlit MCP Server
Integrate and unify knowledge sources for RAG-ready AI context with the Graphlit MCP Server.
Graphlit MCP Server provides a Model Context Protocol interface, enabling seamless integration between MCP clients and the Graphlit platform. It supports ingestion from a wide array of sources such as Slack, Discord, Google Drive, email, Jira, and GitHub, turning them into a searchable, RAG-ready knowledge base. Built-in tools allow for document, media extraction, web crawling, and web search, as well as advanced retrieval and publishing functionalities. The server facilitates easy configuration, sophisticated data operations, and automated notifications for diverse workflows.
- ⭐ 369
- MCP
- graphlit/graphlit-mcp-server
-
MarkItDown
Convert diverse files into Markdown for seamless LLM integration.
MarkItDown is a lightweight Python utility for converting a wide range of file types—including PDF, Office documents, images, audio, websites, and more—into structured Markdown optimized for language models and text analysis tools. It includes an implementation of the Model Context Protocol (MCP) to facilitate integration with LLM applications, such as Claude Desktop. MarkItDown supports context-aware document conversions, prioritizing preservation of hierarchy and meaningful content, and can be used via CLI or as a library.
- ⭐ 82,918
- MCP
- microsoft/markitdown
-
Markdownify MCP Server
Convert diverse files and web content into Markdown via the Model Context Protocol.
Markdownify MCP Server offers a protocol-based server that transforms various file types—including PDF, images, audio, DOCX, XLSX, and PPTX—as well as web content like YouTube videos, Bing search results, and web pages into Markdown format. The server exposes a suite of conversion tools through a standardized interface for easy integration with applications. Optional configuration allows retrieval of Markdown files from restricted directories, and the platform supports development customization for additional tool integration. Deployment and operation are straightforward with cross-platform support (with pending Windows improvements).
- ⭐ 2,256
- MCP
- zcaceres/markdownify-mcp
-
mcp-pinecone
A Pinecone-backed Model Context Protocol server for semantic search and document management.
mcp-pinecone implements a Model Context Protocol (MCP) server that integrates with Pinecone indexes for use with clients such as Claude Desktop. It provides powerful tools for semantic search, document reading, listing, and processing within a Pinecone vector database. The server supports operations like embedding, chunking, and upserting records, enabling contextual management of large document sets. Designed for ease of installation and interoperability via the MCP standard.
- ⭐ 150
- MCP
- sirmews/mcp-pinecone