MCPs
Vectara MCP Server

Vectara MCP Server

Secure RAG server enabling seamless AI integration via Model Context Protocol.

Stars

Forks

Watchers

Issues

Vectara MCP Server implements the open Model Context Protocol to enable AI systems and agentic applications to connect securely with Vectara's Trusted RAG platform. It supports multiple transport modes, including secure HTTP, Server-Sent Events (SSE), and local STDIO for development. The server provides fast, reliable retrieval-augmented generation (RAG) operations with built-in authentication, rate limiting, and optional CORS configuration. Integration is compatible with Claude Desktop and any other MCP client.

Key Features

Implements Model Context Protocol (MCP) standard

Supports HTTP, SSE, and STDIO transport modes

Built-in authentication with API and bearer tokens

Rate limiting and CORS support

Deployment-ready for cloud and local development

Compatible with Claude Desktop and other MCP clients

Streaming responses via Server-Sent Events

Command-line configuration options

API key management tools

Production-grade security configurations

Use Cases

Connecting AI agents to Vectara's Trusted RAG platform

Running secure, authenticated retrieval-augmented generation servers

Integrating local or cloud-based AI applications with external data sources

Enabling Claude Desktop and other MCP clients to perform RAG queries

Facilitating real-time streaming search results for AI systems

Developing and testing agentic applications with local STDIO transport

Managing API keys and authentication for AI data interactions

Deploying scalable RAG solutions with fine-grained security controls

Rapid prototyping of context-aware AI apps

Providing seamless context access for sophisticated AI workflows

README

Vectara MCP Server

🔌 Compatible with Claude Desktop, and any other MCP Client!

Vectara MCP is also compatible with any MCP client

The Model Context Protocol (MCP) is an open standard that enables AI systems to interact seamlessly with various data sources and tools, facilitating secure, two-way connections.

Vectara-MCP provides any agentic application with access to fast, reliable RAG with reduced hallucination, powered by Vectara's Trusted RAG platform, through the MCP protocol.

Installation

You can install the package directly from PyPI:

bash

pip install vectara-mcp

Quick Start

Secure by Default (HTTP/SSE with Authentication)

bash

# Start server with secure HTTP transport (DEFAULT)
python -m vectara_mcp
# Server running at http://127.0.0.1:8000 with authentication enabled

Local Development Mode (STDIO)

bash

# For Claude Desktop or local development (less secure)
python -m vectara_mcp --stdio
# ⚠️ Warning: STDIO transport is less secure. Use only for local development.

Configuration Options

bash

# Custom host and port
python -m vectara_mcp --host 0.0.0.0 --port 8080

# SSE transport mode
python -m vectara_mcp --transport sse --path /sse

# Disable authentication (DANGEROUS - dev only)
python -m vectara_mcp --no-auth

Transport Modes

HTTP Transport (Default - Recommended)

Security: Built-in authentication via bearer tokens
Encryption: HTTPS ready
Rate Limiting: 100 requests/minute by default
CORS Protection: Configurable origin validation
Use Case: Production deployments, cloud environments

SSE Transport

Streaming: Server-Sent Events for real-time updates
Authentication: Bearer token support
Compatibility: Works with legacy MCP clients
Use Case: Real-time streaming applications

STDIO Transport

⚠️ Security Warning: No transport-layer security
Performance: Low latency for local communication
Use Case: Local development, Claude Desktop
Requirement: Must be explicitly enabled with --stdio flag

Environment Variables

bash

# Required
export VECTARA_API_KEY="your-api-key"

# Optional
export VECTARA_AUTHORIZED_TOKENS="token1,token2"  # Additional auth tokens
export VECTARA_ALLOWED_ORIGINS="http://localhost:*,https://app.example.com"
export VECTARA_TRANSPORT="http"  # Default transport mode
export VECTARA_AUTH_REQUIRED="true"  # Enforce authentication

Authentication

HTTP/SSE Transport

When using HTTP or SSE transport, authentication is required by default:

bash

# Using curl with bearer token
curl -H "Authorization: Bearer $VECTARA_API_KEY" \
     -H "Content-Type: application/json" \
     -X POST http://localhost:8000/call/ask_vectara \
     -d '{"query": "What is Vectara?", "corpus_keys": ["my-corpus"]}'

# Using X-API-Key header (alternative)
curl -H "X-API-Key: $VECTARA_API_KEY" \
     http://localhost:8000/sse

Disabling Authentication (Development Only)

bash

# ⚠️ NEVER use in production
python -m vectara_mcp --no-auth

Available Tools

API Key Management

setup_vectara_api_key: Configure and validate your Vectara API key for the session (one-time setup).

Args:
- api_key: str, Your Vectara API key - required.
Returns:
- Success confirmation with masked API key or validation error.
clear_vectara_api_key: Clear the stored API key from server memory.

Returns:
- Confirmation message.

Query Tools

ask_vectara: Run a RAG query using Vectara, returning search results with a generated response.

Args:
- query: str, The user query to run - required.
- corpus_keys: list[str], List of Vectara corpus keys to use for the search - required.
- n_sentences_before: int, Number of sentences before the answer to include in the context - optional, default is 2.
- n_sentences_after: int, Number of sentences after the answer to include in the context - optional, default is 2.
- lexical_interpolation: float, The amount of lexical interpolation to use - optional, default is 0.005.
- max_used_search_results: int, The maximum number of search results to use - optional, default is 10.
- generation_preset_name: str, The name of the generation preset to use - optional, default is "vectara-summary-table-md-query-ext-jan-2025-gpt-4o".
- response_language: str, The language of the response - optional, default is "eng".
Returns:
- The response from Vectara, including the generated answer and the search results.
search_vectara: Run a semantic search query using Vectara, without generation.

Args:
- query: str, The user query to run - required.
- corpus_keys: list[str], List of Vectara corpus keys to use for the search - required.
- n_sentences_before: int, Number of sentences before the answer to include in the context - optional, default is 2.
- n_sentences_after: int, Number of sentences after the answer to include in the context - optional, default is 2.
- lexical_interpolation: float, The amount of lexical interpolation to use - optional, default is 0.005.
Returns:
- The response from Vectara, including the matching search results.

Analysis Tools

correct_hallucinations: Identify and correct hallucinations in generated text using Vectara's VHC (Vectara Hallucination Correction) API.

Args:
- generated_text: str, The generated text to analyze for hallucinations - required.
- documents: list[str], List of source documents to compare against - required.
- query: str, The original user query that led to the generated text - optional.
Returns:
- JSON-formatted string containing corrected text and detailed correction information.
eval_factual_consistency: Evaluate the factual consistency of generated text against source documents using Vectara's dedicated factual consistency evaluation API.

Args:
- generated_text: str, The generated text to evaluate for factual consistency - required.
- documents: list[str], List of source documents to compare against - required.
- query: str, The original user query that led to the generated text - optional.
Returns:
- JSON-formatted string containing factual consistency evaluation results and scoring.

Note: API key must be configured first using setup_vectara_api_key tool or VECTARA_API_KEY environment variable.

Configuration with Claude Desktop

To use with Claude Desktop, update your configuration to use STDIO transport:

json

{
  "mcpServers": {
    "Vectara": {
      "command": "python",
      "args": ["-m", "vectara_mcp", "--stdio"],
      "env": {
        "VECTARA_API_KEY": "your-api-key"
      }
    }
  }
}

Or using uv:

json

{
  "mcpServers": {
    "Vectara": {
      "command": "uv",
      "args": ["tool", "run", "vectara-mcp", "--stdio"]
    }
  }
}

Note: Claude Desktop requires STDIO transport. While less secure than HTTP, it's acceptable for local desktop use.

Usage in Claude Desktop App

Once the installation is complete, and the Claude desktop app is configured, you must completely close and re-open the Claude desktop app to see the Vectara-mcp server. You should see a hammer icon in the bottom left of the app, indicating available MCP tools, you can click on the hammer icon to see more detail on the Vectara-search and Vectara-extract tools.

Now claude will have complete access to the Vectara-mcp server, including all six Vectara tools.

Secure Setup Workflow

First-time setup (one-time per session):

Configure your API key securely:

setup-vectara-api-key
API key: [your-vectara-api-key]

After setup, use any tools without exposing your API key:

Vectara Tool Examples

RAG Query with Generation:

ask-vectara
Query: Who is Amr Awadallah?
Corpus keys: ["your-corpus-key"]

Semantic Search Only:

search-vectara
Query: events in NYC?
Corpus keys: ["your-corpus-key"]

Hallucination Detection & Correction:

correct-hallucinations
Generated text: [text to check]
Documents: ["source1", "source2"]

Factual Consistency Evaluation:

eval-factual-consistency
Generated text: [text to evaluate]
Documents: ["reference1", "reference2"]

Security Best Practices

Always use HTTP transport for production - Never expose STDIO transport to the network
Keep authentication enabled - Only disable with --no-auth for local testing
Use HTTPS in production - Deploy behind a reverse proxy with TLS termination
Configure CORS properly - Set VECTARA_ALLOWED_ORIGINS to restrict access
Rotate API keys regularly - Update VECTARA_API_KEY and VECTARA_AUTHORIZED_TOKENS
Monitor rate limits - Default 100 req/min, adjust based on your needs

See SECURITY.md for detailed security guidelines.

Support

For issues, questions, or contributions, please visit: https://github.com/vectara/vectara-mcp

Star History

Repository Owner

vectara

Organization

Repository Details

Language Python

Default Branch main

Size 139 KB

Contributors 1

License Apache License 2.0

MCP Verified Nov 12, 2025

Programming Languages

Python

99.66%

Dockerfile

0.27%

Shell

0.06%

Related MCPs

Discover similar Model Context Protocol servers

mcp-local-rag

Local RAG server for web search and context injection using Model Context Protocol.

mcp-local-rag is a local server implementing the Model Context Protocol (MCP) to provide retrieval-augmented generation (RAG) capabilities. It performs live web search, extracts relevant context using Google's MediaPipe Text Embedder, and supplies the information to large language models (LLMs) for enhanced, up-to-date responses. The tool is designed for easy local deployment, requiring no external APIs, and is compatible with multiple MCP clients. Security audits are available, and integration is demonstrated across several LLM platforms.

⭐ 89
MCP
nkapila6/mcp-local-rag

RAD Security MCP Server

AI-powered security insights for Kubernetes and cloud environments via the Model Context Protocol.

RAD Security MCP Server is an implementation of the Model Context Protocol designed to deliver AI-powered security insights and operations for Kubernetes and cloud platforms. It serves as an MCP server for RAD Security, providing a range of toolkits for container, cluster, identity, audit, and threat management. The server is easily configurable via environment variables, allowing for flexible toolkit activation and authentication options. Multiple deployment options are supported, including Node.js, Docker, and integration with development environments like Cursor IDE and Claude Desktop.

⭐ 5
MCP
rad-security/mcp-server

Dappier MCP Server

Real-time web search and premium data access for AI agents via Model Context Protocol.

Dappier MCP Server enables fast, real-time web search and access to premium data sources, including news, financial markets, sports, and weather, for AI agents using the Model Context Protocol (MCP). It integrates seamlessly with tools like Claude Desktop and Cursor, allowing users to enhance their AI workflows with up-to-date, trusted information. Simple installation and configuration are provided for multiple platforms, leveraging API keys for secure access. The solution supports deployment via Smithery and direct installation with 'uv', facilitating rapid setup for developers.

⭐ 35
MCP
DappierAI/dappier-mcp

Lara Translate MCP Server

Context-aware translation server implementing the Model Context Protocol.

Lara Translate MCP Server enables AI applications to seamlessly access professional translation services via the standardized Model Context Protocol. It supports features such as language detection, context-aware translations, and translation memory integration. The server acts as a secure bridge between AI models and Lara Translate, managing credentials and facilitating structured translation requests and responses.

⭐ 76
MCP
translated/lara-mcp

Exa MCP Server

Fast, efficient web and code context for AI coding assistants.

Exa MCP Server provides a Model Context Protocol (MCP) server interface that connects AI assistants to Exa AI’s powerful search capabilities, including code, documentation, and web search. It enables coding agents to retrieve precise, token-efficient context from billions of sources such as GitHub, StackOverflow, and documentation sites, reducing hallucinations in coding agents. The platform supports integration with popular tools like Cursor, Claude, and VS Code through standardized MCP configuration, offering configurable access to various research and code-related tools via HTTP.

⭐ 3,224
MCP
exa-labs/exa-mcp-server

Driflyte MCP Server

Bridging AI assistants with deep, topic-aware knowledge from web and code sources.

Driflyte MCP Server acts as a bridge between AI-powered assistants and diverse, topic-aware content sources by exposing a Model Context Protocol (MCP) server. It enables retrieval-augmented generation workflows by crawling, indexing, and serving topic-specific documents from web pages and GitHub repositories. The system is extensible, with planned support for additional knowledge sources, and is designed for easy integration with popular AI tools such as ChatGPT, Claude, and VS Code.

⭐ 9
MCP
serkan-ozal/driflyte-mcp-server

View all Alternatives

Didn't find tool you were looking for?

Search AI Tools

Vectara MCP Server

Key Features

Use Cases

README

Vectara MCP Server

Installation

Quick Start

Secure by Default (HTTP/SSE with Authentication)

Local Development Mode (STDIO)

Configuration Options

Transport Modes

HTTP Transport (Default - Recommended)

SSE Transport

STDIO Transport

Environment Variables

Authentication

HTTP/SSE Transport

Disabling Authentication (Development Only)

Available Tools

API Key Management

Query Tools

Analysis Tools

Configuration with Claude Desktop

Usage in Claude Desktop App

Secure Setup Workflow

Vectara Tool Examples

Security Best Practices

Support

Star History

Repository Owner

Repository Details

Programming Languages

Tags

Related MCPs