MCPs
Scrapeless MCP Server

Scrapeless MCP Server

A real-time web integration layer for LLMs and AI agents built on the open MCP standard.

Stars

Forks

Watchers

Issues

Scrapeless MCP Server is a powerful integration layer enabling large language models, AI agents, and applications to interact with the web in real time. Built on the open Model Context Protocol, it facilitates seamless connections between models like ChatGPT, Claude, and tools such as Cursor to external web capabilities, including Google services, browser automation, and advanced data extraction. The system supports multiple transport modes and is designed to provide dynamic, real-world context to AI workflows. Robust scraping, dynamic content handling, and flexible export formats are core parts of the feature set.

Key Features

Real-time web interaction for AI models

Support for Google Search and Trends integration

Browser automation including navigation, clicking, and scrolling

Scraping of dynamic, JavaScript-rendered websites

Export of data as HTML, Markdown, or screenshots

Streamable HTTP and Stdio transport modes

Automated bypass of Cloudflare protection

Universal API with flexible export options

Integration with LLMs like ChatGPT and Claude

API key-based authentication and management

Use Cases

Enabling AI agents to perform automated web browsing and interaction

Extracting real-time web data for research assistants or coding copilots

Scraping search engine results for keyword analysis

Retrieving and exporting dynamically-rendered website content

Summarizing information from websites through AI conversations

Bypassing Cloudflare and anti-bot mechanisms for content extraction

Converting web page contents to Markdown or HTML for downstream processing

Generating live previews of web automation for AI applications

Automating bulk data extraction for analytics tools

Integrating context-rich web data into AI-driven workflows

README

Scrapeless MCP Server

Welcome to the official Scrapeless Model Context Protocol (MCP) Server — a powerful integration layer that empowers LLMs, AI Agents, and AI applications to interact with the web in real time.

Built on the open MCP standard, Scrapeless MCP Server seamlessly connects models like ChatGPT, Claude, and tools like Cursor and Windsurf to a wide range of external capabilities, including:

Google services integration (Search, Trends)
Browser automation for page-level navigation and interaction
Scrape dynamic, JS-heavy sites—export as HTML, Markdown, or screenshots

Whether you're building an AI research assistant, a coding copilot, or autonomous web agents, this server provides the dynamic context and real-world data your workflows need—without getting blocked.

Usage Examples

Automated Web Interaction and Data Extraction with Claude

Using Scrapeless MCP Browser, Claude can perform complex tasks such as web navigation, clicking, scrolling, and scraping through conversational commands, with real-time preview of web interaction results via live sessions.

Bypassing Cloudflare to Retrieve Target Page Content

Using the Scrapeless MCP Browser service, the Cloudflare page is automatically accessed, and after the process is completed, the page content is extracted and returned in Markdown format.

Extracting Dynamically Rendered Page Content and Writing to File

Using the Scrapeless MCP Universal API, the JavaScript-rendered content of the target page above is scraped, exported in Markdown format, and finally written to a local file named text.md.

Automated SERP Scraping

Using the Scrapeless MCP Server, query the keyword “web scraping” on Google Search, retrieve the first 10 search results (including title, link, and summary), and write the content to the file named serp.text.

Here are some additional examples of how to use these servers:

Example
Search scrapeless by Google search.
Find the search interest for "AI" over the last year.
Use a browser to visit chatgpt.com, search for "What's the weather like today?", and summarize the results.
Scrape the HTML content of scrapeless.com page.
Scrape the Markdown content of scrapeless.com page.
Get screenshots of scrapeless.com.

Setup Guide

Get Scrapeless Key

Log in to the Scrapeless Dashboard（Free trial available）
Then click "Setting" on the left -> select "API Key Management" -> click "Create API Key". Finally, click the API Key you created to copy it.

Configure Your MCP Client

Scrapeless MCP Server supports both Stdio and Streamable HTTP transport modes.

🖥️ Stdio (Local Execution)

JSON

{
  "mcpServers": {
    "Scrapeless MCP Server": {
      "command": "npx",
      "args": ["-y", "scrapeless-mcp-server"],
      "env": {
        "SCRAPELESS_KEY": "YOUR_SCRAPELESS_KEY"
      }
    }
  }
}

🌐 Streamable HTTP (Hosted API Mode)

JSON

{
  "mcpServers": {
    "Scrapeless MCP Server": {
      "type": "streamable-http",
      "url": "https://api.scrapeless.com/mcp",
      "headers": {
        "x-api-token": "YOUR_SCRAPELESS_KEY"
      },
      "disabled": false,
      "alwaysAllow": []
    }
  }
}

Advanced Options

Customize browser session behavior with optional parameters. These can be set via environment variables (for Stdio) or HTTP headers (for Streamable HTTP):

Stdio (Env Var)	Streamable HTTP (HTTP Header)	Description
BROWSER_PROFILE_ID	x-browser-profile-id	Specifies a reusable browser profile ID for session continuity.
BROWSER_PROFILE_PERSIST	x-browser-profile-persist	Enables persistent storage for cookies, local storage, etc.
BROWSER_SESSION_TTL	x-browser-session-ttl	Defines the maximum session timeout in seconds. The session will automatically expire after this duration of inactivity.

Integration with Claude Desktop

Open Claude Desktop
Navigate to: Settings → Tools → MCP Servers
Click "Add MCP Server"
Paste either the Stdio or Streamable HTTP config above
Save and enable the server
Claude will now be able to issue web queries, extract content, and interact with pages using Scrapeless

Integration with Cursor IDE

Open Cursor
Press Cmd + Shift + P and search for: Configure MCP Servers
Add the Scrapeless MCP config using the format above
Save the file and restart Cursor (if needed)
Now you can ask Cursor things like:
1. "Search StackOverflow for a solution to this error"
2. "Scrape the HTML from this page"
And it will use Scrapeless in the background.

Supported MCP Tools

Name	Description
google_search	Universal information search engine.
google_trends	Get trending search data from Google Trends.
browser_create	Create or reuse a cloud browser session using Scrapeless.
browser_close	Closes the current session by disconnecting the cloud browser.
browser_goto	Navigate browser to a specified URL.
browser_go_back	Go back one step in browser history.
browser_go_forward	Go forward one step in browser history.
browser_click	Click a specific element on the page.
browser_type	Type text into a specified input field.
browser_press_key	Simulate a key press.
browser_wait_for	Wait for a specific page element to appear.
browser_wait	Pause execution for a fixed duration.
browser_screenshot	Capture a screenshot of the current page.
browser_get_html	Get the full HTML of the current page.
browser_get_text	Get all visible text from the current page.
browser_scroll	Scroll to the bottom of the page.
browser_scroll_to	Scroll a specific element into view.
scrape_html	Scrape a URL and return its full HTML content.
scrape_markdown	Scrape a URL and return its content as Markdown.
scrape_screenshot	Capture a high-quality screenshot of any webpage.

Security Best Practices

When using Scrapeless MCP Server with LLMs (like ChatGPT, Claude, or Cursor), it's critical to handle all scraped or extracted web content with care. Web data is untrusted by default, and improper handling may expose your application to prompt injection or other security vulnerabilities.

✅ Recommended Practices

Never pass raw scraped content directly into LLM prompts. Raw HTML, JavaScript, or user-generated text may contain hidden injection payloads.
Sanitize and validate all extracted content. Strip or escape potentially harmful tags and scripts before using content in downstream logic or AI models.
Prefer structured extraction over free-form text. Use tools like scrape_html, scrape_markdown, or targeted browser_get_text with known-safe selectors to extract only the content you trust.
Apply domain or selector whitelisting when scraping dynamically generated pages, to restrict data flow to known and trusted sources.
Log and monitor all outbound requests made via browser or scraping tools, especially if you're handling sensitive data, tokens, or internal network access.

🚫 Avoid

Injecting scraped HTML directly into prompts
Letting users specify arbitrary URLs or CSS selectors without validation
Storing unfiltered scraped content for future prompt usage

Community

MCP Server Discord

Contact Us

For questions, suggestions, or collaboration inquiries, feel free to contact us via:

Email: market@scrapeless.com
Official Website: https://www.scrapeless.com
Community Forum: https://discord.gg/Np4CAHxB9a

Star History

Repository Owner

scrapeless-ai

Organization

Repository Details

Language TypeScript

Default Branch main

Size 15,702 KB

Contributors 4

License MIT License

MCP Verified Nov 12, 2025

Programming Languages

TypeScript

98%

JavaScript

1.2%

Dockerfile

0.79%

Related MCPs

Discover similar Model Context Protocol servers

WebScraping.AI MCP Server

MCP server for advanced web scraping and AI-driven data extraction

WebScraping.AI MCP Server implements the Model Context Protocol to provide web data extraction and question answering functionalities. It integrates with WebScraping.AI to offer robust tools for retrieving, rendering, and parsing web content, including structured data and natural language answers from web pages. It supports JavaScript rendering, proxy management, device emulation, and custom extraction configurations, making it suitable for both individual and team deployments in AI-assisted workflows.

⭐ 33
MCP
webscraping-ai/webscraping-ai-mcp-server

AgentQL MCP Server

MCP-compliant server for structured web data extraction using AgentQL.

AgentQL MCP Server acts as a Model Context Protocol (MCP) server that leverages AgentQL's data extraction capabilities to fetch structured information from web pages. It allows integration with applications supporting MCP, such as Claude Desktop, VS Code, and Cursor, by providing an accessible interface for extracting structured data based on user-defined prompts. With configurable API key support and streamlined installation, it simplifies the process of connecting web data extraction workflows to AI tools.

⭐ 120
MCP
tinyfish-io/agentql-mcp

Google Workspace MCP Server

Full natural language control of Google Workspace through the Model Context Protocol.

Google Workspace MCP Server enables comprehensive natural language interaction with Google services such as Calendar, Drive, Gmail, Docs, Sheets, Slides, Forms, Tasks, and Chat via any MCP-compatible client or AI assistant. It supports both single-user and secure multi-user OAuth 2.1 authentication, providing a production-ready backend for custom apps. Built on FastMCP, it delivers high performance and advanced context handling, offering deep integration with the entire Google Workspace suite.

⭐ 890
MCP
taylorwilsdon/google_workspace_mcp

MyMCP Server (All-in-One Model Context Protocol)

Powerful and extensible Model Context Protocol server with developer and productivity integrations.

MyMCP Server is a robust Model Context Protocol (MCP) server implementation that integrates with services like GitLab, Jira, Confluence, YouTube, Google Workspace, and more. It provides AI-powered search, contextual tool execution, and workflow automation for development and productivity tasks. The system supports extensive configuration and enables selective activation of grouped toolsets for various environments. Installation and deployment are streamlined, with both automated and manual setup options available.

⭐ 93
MCP
nguyenvanduocit/all-in-one-model-context-protocol

Driflyte MCP Server

Bridging AI assistants with deep, topic-aware knowledge from web and code sources.

Driflyte MCP Server acts as a bridge between AI-powered assistants and diverse, topic-aware content sources by exposing a Model Context Protocol (MCP) server. It enables retrieval-augmented generation workflows by crawling, indexing, and serving topic-specific documents from web pages and GitHub repositories. The system is extensible, with planned support for additional knowledge sources, and is designed for easy integration with popular AI tools such as ChatGPT, Claude, and VS Code.

⭐ 9
MCP
serkan-ozal/driflyte-mcp-server

Weblate MCP Server

Seamlessly connect AI assistants to Weblate for translation management via the Model Context Protocol.

Weblate MCP Server enables AI assistants and clients to directly manage Weblate translation projects through the Model Context Protocol (MCP). It integrates with the Weblate REST API, allowing natural language interaction for project and translation management. The tool offers multiple transport options including HTTP, SSE, and STDIO, and is optimized for large language model workflows. Full support for project, component, and translation operations is provided, with a focus on type safety and flexible environment configuration.

⭐ 9
MCP
mmntm/weblate-mcp

View all Alternatives

Didn't find tool you were looking for?

Search AI Tools

Scrapeless MCP Server

Key Features

Use Cases

README

Scrapeless MCP Server

Usage Examples

Setup Guide

Advanced Options

Integration with Claude Desktop

Integration with Cursor IDE

Supported MCP Tools

Security Best Practices

✅ Recommended Practices

🚫 Avoid

Community

Contact Us

Star History

Repository Owner

Repository Details

Programming Languages

Tags

Related MCPs