Markdownify MCP Server

Markdownify MCP Server

Convert diverse files and web content into Markdown via the Model Context Protocol.

2,256
Stars
188
Forks
2,256
Watchers
15
Issues
Markdownify MCP Server offers a protocol-based server that transforms various file types—including PDF, images, audio, DOCX, XLSX, and PPTX—as well as web content like YouTube videos, Bing search results, and web pages into Markdown format. The server exposes a suite of conversion tools through a standardized interface for easy integration with applications. Optional configuration allows retrieval of Markdown files from restricted directories, and the platform supports development customization for additional tool integration. Deployment and operation are straightforward with cross-platform support (with pending Windows improvements).

Key Features

Converts PDF, image, audio, DOCX, XLSX, and PPTX files to Markdown
Extracts YouTube transcripts and web page content as Markdown
Transforms Bing search results into Markdown format
Retrieves existing Markdown files with directory restriction option
Extensible toolset for adding new conversions
Configurable server integration for desktop apps
Supports audio transcription to Markdown
Image metadata extraction and Markdown conversion
Automated build and deployment with TypeScript and PNPM
MIT licensed and open to contributions

Use Cases

Automated document archiving in Markdown format
Preparing meeting or lecture transcripts from audio and video for note-taking apps
Extracting and standardizing web research into Markdown for knowledge management
Batch conversion of office documents for collaborative editing or publishing
Centralizing markdown conversion services for teams or platforms
Integrating markdown extraction into desktop knowledge management tools
Transcribing and converting images with metadata for research or documentation
Retrieving and sharing Markdown files from controlled directories
Supporting multi-source content ingestion pipelines
Powering markdown-based digital libraries or content management systems

README

Markdownify MCP Server

Help! I need someone with a Windows computer to help me add support for Markdownify-MCP on Windows. PRs exist but I cannot test them. Post here if interested.

markdownify mcp logo

Markdownify is a Model Context Protocol (MCP) server that converts various file types and web content to Markdown format. It provides a set of tools to transform PDFs, images, audio files, web pages, and more into easily readable and shareable Markdown text.

Features

  • Convert multiple file types to Markdown:
    • PDF
    • Images
    • Audio (with transcription)
    • DOCX
    • XLSX
    • PPTX
  • Convert web content to Markdown:
    • YouTube video transcripts
    • Bing search results
    • General web pages
  • Retrieve existing Markdown files

Getting Started

  1. Clone this repository
  2. Install dependencies:
    pnpm install
    

Note: this will also install uv and related Python depdencies.

  1. Build the project:
    pnpm run build
    
  2. Start the server:
    pnpm start
    

Development

  • Use pnpm run dev to start the TypeScript compiler in watch mode
  • Modify src/server.ts to customize server behavior
  • Add or modify tools in src/tools.ts

Usage with Desktop App

To integrate this server with a desktop app, add the following to your app's server configuration:

js
{
  "mcpServers": {
    "markdownify": {
      "command": "node",
      "args": [
        "{ABSOLUTE PATH TO FILE HERE}/dist/index.js"
      ],
      "env": {
        // By default, the server will use the default install location of `uv`
        "UV_PATH": "/path/to/uv"
      }
    }
  }
}

Available Tools

  • youtube-to-markdown: Convert YouTube videos to Markdown

  • pdf-to-markdown: Convert PDF files to Markdown

  • bing-search-to-markdown: Convert Bing search results to Markdown

  • webpage-to-markdown: Convert web pages to Markdown

  • image-to-markdown: Convert images to Markdown with metadata

  • audio-to-markdown: Convert audio files to Markdown with transcription

  • docx-to-markdown: Convert DOCX files to Markdown

  • xlsx-to-markdown: Convert XLSX files to Markdown

  • pptx-to-markdown: Convert PPTX files to Markdown

  • get-markdown-file: Retrieve an existing Markdown file. File extension must end with: *.md, *.markdown.

    OPTIONAL: set MD_SHARE_DIR env var to restrict the directory from which files can be retrieved, e.g. MD_SHARE_DIR=[SOME_PATH] pnpm run start

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Star History

Star History Chart

Repository Owner

zcaceres
zcaceres

User

Repository Details

Language TypeScript
Default Branch main
Size 1,581 KB
Contributors 6
License MIT License
MCP Verified Nov 12, 2025

Programming Languages

TypeScript
86.95%
Dockerfile
5.28%
JavaScript
4.05%
Shell
1.87%
Batchfile
1.85%

Tags

Topics

ai anthropic anthropic-ai anthropic-claude markdown mcp model-context-protocol ocr tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

We respect your privacy. Unsubscribe at any time.

Related MCPs

Discover similar Model Context Protocol servers

  • MarkItDown

    MarkItDown

    Convert diverse files into Markdown for seamless LLM integration.

    MarkItDown is a lightweight Python utility for converting a wide range of file types—including PDF, Office documents, images, audio, websites, and more—into structured Markdown optimized for language models and text analysis tools. It includes an implementation of the Model Context Protocol (MCP) to facilitate integration with LLM applications, such as Claude Desktop. MarkItDown supports context-aware document conversions, prioritizing preservation of hierarchy and meaningful content, and can be used via CLI or as a library.

    • 82,918
    • MCP
    • microsoft/markitdown
  • Markmap MCP Server

    Markmap MCP Server

    Convert Markdown to interactive mind maps via the Model Context Protocol.

    Markmap MCP Server enables seamless conversion of Markdown content into interactive mind maps using the Model Context Protocol (MCP). It leverages the open-source markmap project and provides users with diverse export formats including PNG, JPG, and SVG. Designed for easy integration with MCP clients, it offers tools for automated browser previews, rich interactivity, and batch mind map generation. The server can be installed easily via npm or Smithery and supports configurable output directories.

    • 137
    • MCP
    • jinzcdev/markmap-mcp-server
  • MCP Content Summarizer Server

    MCP Content Summarizer Server

    Intelligent multi-format content summarization via MCP interface.

    MCP Content Summarizer Server provides intelligent summarization of various content types including text, web pages, PDF documents, and EPUB books using Google's Gemini 1.5 Pro model. Through the Model Context Protocol, it supports customizable, multi-language summaries with options for style and focus. It is designed for integration with applications as an MCP server and offers tools for both summarization and testing. The solution maintains key information while producing concise and context-aware summaries from diverse content sources.

    • 142
    • MCP
    • 0xshellming/mcp-summarizer
  • PDF Tools MCP

    PDF Tools MCP

    Comprehensive PDF manipulation via MCP protocol.

    PDF Tools MCP provides an extensive suite of PDF manipulation operations using the Model Context Protocol framework. It supports both local and remote PDF tasks, such as rendering pages, merging, extracting metadata, retrieving text, and combining documents. The tool registers endpoints through the MCP protocol, enabling seamless server-based PDF processing for various clients. Built with Python, it emphasizes secure handling and compatibility with Claude Desktop via the Smithery ecosystem.

    • 31
    • MCP
    • danielkennedy1/pdf-tools-mcp
  • mcp-read-website-fast

    mcp-read-website-fast

    Fast, token-efficient web content extraction and Markdown conversion for AI agents.

    Provides a Model Context Protocol (MCP) compatible server that rapidly fetches web pages, removes noise, and converts content to clean Markdown with link preservation. Designed for local use by AI-powered tools like IDEs and large language models, it offers optimized token usage, concurrency, polite crawling, and smart caching. Integrates with Claude Code, VS Code, JetBrains IDEs, Cursor, and other MCP clients.

    • 111
    • MCP
    • just-every/mcp-read-website-fast
  • Graphlit MCP Server

    Graphlit MCP Server

    Integrate and unify knowledge sources for RAG-ready AI context with the Graphlit MCP Server.

    Graphlit MCP Server provides a Model Context Protocol interface, enabling seamless integration between MCP clients and the Graphlit platform. It supports ingestion from a wide array of sources such as Slack, Discord, Google Drive, email, Jira, and GitHub, turning them into a searchable, RAG-ready knowledge base. Built-in tools allow for document, media extraction, web crawling, and web search, as well as advanced retrieval and publishing functionalities. The server facilitates easy configuration, sophisticated data operations, and automated notifications for diverse workflows.

    • 369
    • MCP
    • graphlit/graphlit-mcp-server
  • Didn't find tool you were looking for?

    Be as detailed as possible for better results