CICADA

Structured, contextual code intelligence for AI assistants on Elixir projects.

Stars

Forks

Watchers

Issues

CICADA is an MCP server that provides AI assistants with AST-level, structured access to Elixir codebases. It enables code analysis, semantic search, module and function discovery, and git/PR attribution for deeper contextual understanding. CICADA supports multiple editors and offers features like dependency mapping, dead-code detection, and local indexing with strong privacy guarantees.

Key Features

AST-based module and function analysis

Semantic and keyword code search

Git and PR attribution for code context

Call-site and dependency tracking

Dead code detection

Local, privacy-focused indexing

Integration with multiple editors (Claude, Cursor, VS Code, etc.)

Automatic reindexing with watch mode

Wildcard and pattern search support

CLI-based setup and on-demand server operation

Use Cases

Enhancing AI code assistants with deep, structured Elixir code knowledge

Locating all usages of a function or module within a codebase

Understanding module dependencies and designing safe refactors

Tracking why and how code changes were introduced via PR attribution

Identifying and removing unused (dead) code

Accelerating semantic code search for features or fixes

Syncing and auto-reindexing changes during development

Improving prompt efficiency and accuracy for code-related queries

Providing instant contextual answers in supported code editors

Supporting safe navigation and automated updates across large projects

README

CICADA

Code Intelligence: Contextual Analysis, Discovery, and Attribution

Give your AI assistant structured access to your Elixir codebase.

Quick Install · Security · Developers · AI Assistants · Docs

What's New in 0.3

Dependency analysis tools - New get_module_dependencies and get_function_dependencies with transitive depth support
Expanded editor support - Added Gemini CLI, Codex, and OpenCode (now 6 editors supported)
Watch mode - Automatic reindexing with cicada watch or --watch flag
Better discoverability - Smart error suggestions, inline PR context, staleness warnings
Wildcard & OR patterns - Search with MyApp.*, create*|update* across modules and functions
Breaking: Removed CICADA_REPO_PATH environment variable

Why CICADA?

Traditional AI assistants treat your repo like a pile of text. That leads to:

Token waste: blind grep dumps that burn 3k+ tokens per question.
Hallucinated edits: aliases/imports hide call sites, so refactors miss real usages.
No historical context: design intent and PR trade-offs never make it into the prompt.

CICADA is an MCP server that gives assistants AST-level knowledge:

Module + function definitions with signatures, specs, docs, owning files.
Complete call-site tracking (aliases, imports, dynamic references).
Semantic/keyword search so you can ask for "authentication" even if it's called verify_credentials/2.
Git + PR attribution to surface why code exists.
Dead-code detection and module dependency views for safe refactors.

Result: in our comparison, the same question dropped from 3,127 tokens / 52.8s to 550 tokens / 35s with correct answers.

Quick Install

bash

# 1. Install uv (if needed)
curl -LsSf https://astral.sh/uv/install.sh | sh

# 2. Install CICADA
uv tool install cicada-mcp

# 3. Index your Elixir project
cd /path/to/project
cicada claude   # or: cicada cursor, cicada vs, cicada gemini, cicada codex, cicada opencode

# 4. Restart your editor

bash

uvx --from cicada-mcp cicada claude   # or cursor, vs

Runs CICADA on demand (slower after the first run, but zero install).

Available commands after installation:

cicada [claude|cursor|vs|gemini|codex|opencode] - One-command setup per project
cicada-mcp - MCP server (auto-started by editor)
cicada watch - Watch for file changes and automatically reindex
cicada index - Re-index code with custom options (-f/--force + --fast/--regular/--max, --watch)
cicada index-pr - Index pull requests for PR attribution
cicada find-dead-code - Find potentially unused functions

Ask your assistant:

"Show me the functions in MyApp.User"
"Where is authenticate/2 called?"
"Find code related to API authentication"

Privacy & Security

100% local: parsing + indexing happen on your machine; no cloud uploads.
No telemetry: CICADA doesn't collect usage or phone home.
Read-only tools: MCP endpoints only read the index; they can't change your repo.
Optional GitHub access: PR features rely on gh and your existing OAuth token.

Data layout:

~/.cicada/projects/<repo_hash>/
├─ index.json      # modules, functions, call sites, metadata
├─ config.yaml     # indexing options + keyword tier
├─ hashes.json     # incremental indexing cache
└─ pr_index.json   # optional PR metadata + reviews

Your repo only gains an editor config (.mcp.json, .cursor/mcp.json, .vscode/settings.json, .gemini/mcp.json, .codex/mcp.json, or .opencode.json).

For Developers

Wire CICADA into your editor once, and every assistant session inherits the context.

Install & Configure

bash

cd /path/to/project
cicada claude   # or cicada cursor / cicada vs / cicada gemini / cicada codex / cicada opencode

This command:

Parses every .ex/.exs file with tree-sitter.
Builds the index in ~/.cicada/projects/<hash>/.
Creates the correct MCP config for your editor.
Configures .gitattributes so git can track functions through refactors.

Enable PR Attribution (optional)

bash

brew install gh    # or apt install gh
gh auth login
cicada index-pr .     # incremental
cicada index-pr . --clean   # full rebuild

Unlocks questions like "Which PR introduced line 42?" or "What did reviewers say about billing.ex?"

Automatic Re-indexing with Watch Mode

Enable automatic reindexing when files change by starting the MCP server with the --watch flag:

For Claude Code (.mcp.json):

json

{
  "mcpServers": {
    "cicada": {
      "command": "cicada-mcp",
      "args": ["--watch"],
      "env": {
        "CICADA_CONFIG_DIR": "/home/user/.cicada/projects/<hash>"
      }
    }
  }
}

For Cursor (.cursor/mcp.json):

json

{
  "mcpServers": {
    "cicada": {
      "command": "cicada-mcp",
      "args": ["--watch"]
    }
  }
}

Or use with the server subcommand:

bash

cicada server --watch
cicada server --watch --fast    # Use fast tier for reindexing
cicada server --watch --max     # Use max tier for reindexing

When watch mode is enabled:

A separate process monitors .ex and .exs files for changes
Changes are automatically reindexed (incremental, fast)
2-second debounce prevents excessive reindexing during rapid edits
The watch process stops automatically when the MCP server stops
Excluded directories: deps, _build, node_modules, .git, assets, priv

CLI Cheat Sheet

Command	Purpose	Run When
`cicada claude`	Configure MCP + incremental re-index	First setup, after local changes
`cicada watch`	Monitor files and auto-reindex on changes	During active development
`cicada index --force --regular .`	Full rebuild w/ semantic keywords	After large refactors or enabling AI tier
`cicada index-pr .`	Sync PR metadata/reviews	After new PRs merge
`cicada find-dead-code --min-confidence high`	List unused public functions	Cleanup sprints

Troubleshooting

Run the indexer first:

bash

cicada index /path/to/project

Ensure indexing completed successfully. Check for ~/.cicada/projects/<hash>/index.json.

Use the exact module name as it appears in code (e.g., MyApp.User, not User).

If module was recently added, re-index:

bash

cicada index .

Troubleshooting checklist:

Verify configuration file exists:

bash

# For Claude Code
ls -la .mcp.json

# For Cursor
ls -la .cursor/mcp.json

# For VS Code
ls -la .vscode/settings.json

Check paths are absolute:

bash

cat .mcp.json
# Should contain: /absolute/path/to/project
# Not: ./project or ../project

Ensure index exists:

bash

ls -la ~/.cicada/projects/
# Should show directory for your project

Restart editor completely (not just reload window)
Check editor MCP logs:
- Claude Code: Console output
- Cursor: Settings → MCP → View Logs
- VS Code: Output panel → MCP

Setup GitHub CLI:

bash

# Install GitHub CLI
brew install gh  # macOS
sudo apt install gh  # Ubuntu
# or visit https://cli.github.com/

# Authenticate
gh auth login

# Index PRs
cicada index-pr .

Common issues:

"No PR index found" → Run cicada index-pr .
"Not a GitHub repository" → Ensure repo has GitHub remote
Slow indexing → First-time indexing fetches all PRs; subsequent runs are incremental
Rate limiting → GitHub API has rate limits; wait and retry if you hit limits

Force rebuild:

bash

cicada index-pr . --clean

Error: "Keyword search not available"

Cause: Index was built without keyword extraction.

Solution:

bash

# Re-index with keyword extraction
cicada index --regular .  # or --fast or --max

Verify:

bash

cat ~/.cicada/projects/<hash>/config.yaml
# Should show keyword_extraction: enabled

More detail: docs/PR_INDEXING.md, docs/08-INCREMENTAL_INDEXING.md.

For AI Assistants

CICADA ships eleven focused MCP tools. Use the decision table to pick the right one:

🧭 Which Tool Should You Use?

Need	Tool	Notes
List a module's API	`search_module`	Supports wildcards (`*`) and OR (`
Find where a function is defined & called	`search_function`	Supports wildcards (`*`) and OR (`
Discover who imports/aliases a module	`search_module_usage`	Great for dependency impact analysis
See what modules a module depends on	`get_module_dependencies`	Shows all modules used by a module (with transitive depth support)
See what functions a function calls	`get_function_dependencies`	Shows all functions called by a function (with transitive depth support)
Search by concept ("authentication", `*_user`)	`search_by_features`	Requires keyword tier index
Identify unused code	`find_dead_code`	Confidence-ranked (high, medium, low)
Find PR for a line	`find_pr_for_line`	Needs `cicada index-pr` + `gh`
View PR history for a file	`get_file_pr_history`	Shows descriptions + review comments
Track function/file evolution	`get_commit_history`	Follows refactors via `.gitattributes`
Show blame with grouped authorship	`get_blame`	Useful when you need owners

Want to see these tools in action? Check out Complete Workflow Examples with pro tips and real-world scenarios.

Core Search Tools

search_module - Find modules and view all their functions

Search by module name or file path with wildcards (MyApp.*, *User*) and OR patterns (MyApp.User|MyApp.Post)
View function signatures with type specs
Filter public/private functions
Output in Markdown or JSON

search_function - Locate function definitions and track usage

Search by function name, arity, or module path with wildcards (create*, *_user) and OR patterns (create*|update*)
See where functions are called with line numbers
View actual code usage examples
Filter for test files only

search_module_usage - Track module dependencies

Find all aliases and imports
See all function calls to a module
Understand module relationships
Map dependencies across codebase

Git History & Attribution Tools

find_pr_for_line - Identify which PR introduced any line of code

Line-level PR attribution via git blame
Author and commit information
Direct links to GitHub PRs
Requires: GitHub CLI + PR index

get_file_pr_history - View complete PR history for a file

All PRs that modified the file
PR descriptions and metadata
Code review comments with line numbers
Requires: GitHub CLI + PR index

get_commit_history - Track file and function evolution over time

Complete commit history for files
Function-level tracking (follows refactors)
Creation and modification timeline
Requires: .gitattributes configuration

get_blame - Show line-by-line code ownership

Grouped authorship display
Commit details for each author
Code snippets with context

Advanced Features

search_by_features (Beta) - Search code by concepts and features

🎯 Perfect for: "I don't know the exact name" - Search by what code does, not what it's called
Find code related to concepts like "authentication", "api key storage", "email validation"
Wildcard pattern matching (create*, *_user, validate_*)
Filter results by type: modules only, functions only, or all
AI-powered keyword extraction from documentation
Relevance scoring to surface the most relevant results
Requires: Index built with keyword extraction (--fast, --regular, or --max)

When to use: You know what you're looking for conceptually but not the exact module/function names. Instead of guessing names with search_function, describe what the code does!

find_dead_code - Identify potentially unused functions

Three confidence levels (high, medium, low)
Smart detection of callbacks and behaviors
Recognition of dynamic call patterns
Module-level grouping with line numbers
Excludes test files and @impl functions

Detailed parameters + output formats: MCP_TOOLS_REFERENCE.md.

Token-Friendly Responses

All tools return structured Markdown/JSON snippets (signatures, call sites, PR metadata) instead of full files, keeping prompts lean.

Learn by Doing (5–10 min each)

1. Safe Refactor Checklist

search_function → "Where is create_user/2 called?"
search_module_usage → "Which modules alias MyApp.User?"
search_function with test_only:true to confirm test coverage.
get_file_pr_history → "Show PRs that modified lib/my_app/user.ex."

2. Untangle Legacy Intent

search_module to skim the API.
get_file_pr_history for design discussions/reviews.
get_commit_history on the hot function.
get_blame on confusing lines to ping the right author.

3. Cleanup Sprint

find_dead_code --min-confidence high for candidates.
For each, search_function to double-check dynamic usage.
find_pr_for_line to ensure it isn't waiting on an unfinished feature.
Remove or deprecate confidently.

For full walkthroughs see docs/17-WORKFLOW_EXAMPLES.md and docs/12-TOOL_DISCOVERABILITY_TASKS.md.

Documentation

CHANGELOG.md – release notes.
docs/01-KEYWORD_EXTRACTION_ANALYSIS.md – semantic search internals.
docs/09-PR_INDEXING.md – GitHub integration details.
docs/16-MCP_TOOL_CALL_BENCHMARKING.md – token/time benchmarks.

Roadmap

Available	Coming Soon
Elixir indexing + AST search	Python + TypeScript support
Semantic keyword tiers (`--fast/regular/max`)	Shared/team indexes
PR attribution + review scraping
Dead-code + dependency analysis

Contributing

bash

git clone https://github.com/wende/cicada.git
cd cicada
uv sync
pytest

Before submitting a PR:

Run black cicada tests
Ensure tests + coverage pass (pytest --cov=cicada --cov-report=term-missing)
Update docs if behaviour changes

We welcome issues/PRs for:

New language grammars
Tool output improvements
Better onboarding docs and tutorials

License

MIT – see LICENSE.

Stop letting your AI search blindly. Give it CICADA.

Get Started · Report Issues

Star History

Repository Owner

wende

User

Repository Details

Language Python

Default Branch main

Size 6,704 KB

Contributors 2

License MIT License

MCP Verified Nov 11, 2025

Programming Languages

Python

98.88%

Shell

0.75%

Makefile

0.35%

Dockerfile

0.02%

Topics

ai code-intelligence elixir indexer mcp rag tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Related MCPs

Discover similar Model Context Protocol servers

Sourcerer MCP

Semantic code search & navigation MCP server for efficient AI agent context retrieval.

Sourcerer MCP provides a Model Context Protocol (MCP) server that enables AI agents to perform semantic code search and navigation. By indexing codebases at the function, class, and chunk level, it allows agents to retrieve only the necessary code snippets, greatly reducing token consumption. The tool integrates with Tree-sitter for language parsing and OpenAI for generating code embeddings, supporting advanced contextual code understanding without full file ingestion.

⭐ 95
MCP
st3v3nmw/sourcerer-mcp

Exa MCP Server

Fast, efficient web and code context for AI coding assistants.

Exa MCP Server provides a Model Context Protocol (MCP) server interface that connects AI assistants to Exa AI’s powerful search capabilities, including code, documentation, and web search. It enables coding agents to retrieve precise, token-efficient context from billions of sources such as GitHub, StackOverflow, and documentation sites, reducing hallucinations in coding agents. The platform supports integration with popular tools like Cursor, Claude, and VS Code through standardized MCP configuration, offering configurable access to various research and code-related tools via HTTP.

⭐ 3,224
MCP
exa-labs/exa-mcp-server

In Memoria

Persistent memory and instant context for AI coding assistants, integrated via MCP.

In Memoria is an MCP server that enables AI coding assistants such as Claude or Copilot to retain, recall, and provide context about codebases across sessions. It learns patterns, architecture, and conventions from user code, offering persistent intelligence that eliminates repetitive explanations and generic suggestions. Through the Model Context Protocol, it allows AI tools to perform semantic search, smart file routing, and track project-specific decisions efficiently.

⭐ 94
MCP
pi22by7/In-Memoria

GitMCP

Instantly turn any GitHub repository into an AI-ready documentation hub.

GitMCP is a free, open-source, remote Model Context Protocol (MCP) server that gives AI assistants real-time access to the latest documentation and code from any GitHub repository. It transforms any GitHub project into an accessible documentation hub, enabling AI tools to deliver accurate results, reduce hallucinations, and improve code correctness. Supporting both specific and generic server modes, it allows seamless integration into developer workflows with zero setup. GitMCP emphasizes privacy, flexibility, and up-to-date information retrieval.

⭐ 6,916
MCP
idosal/git-mcp

Lucidity MCP

Intelligent prompt-based code quality analysis for AI coding assistants.

Lucidity MCP is a Model Context Protocol (MCP) server that empowers AI coding assistants to deliver high-quality code through intelligent, prompt-driven analysis. It offers comprehensive detection of code issues across multiple quality dimensions, providing structured and actionable feedback. With language-agnostic capabilities, extensible framework, and flexible transport options, Lucidity MCP seamlessly integrates into developer workflows and AI systems.

⭐ 72
MCP
hyperb1iss/lucidity-mcp

AI Distiller (aid)

Efficient codebase summarization and context extraction for AI code generation.

AI Distiller enables efficient distillation of large codebases by extracting essential context, such as public interfaces and data types, discarding method implementations and non-public details by default. It helps AI agents like Claude, Cursor, and other MCP-compatible tools understand project architecture more accurately, reducing hallucinations and code errors. With configurable CLI options, it generates condensed contexts that fit within AI model limitations, improving code generation accuracy and integration with the Model Context Protocol.

⭐ 106
MCP
janreges/ai-distiller

View all Alternatives

Didn't find tool you were looking for?

Search AI Tools

CICADA

Key Features

Use Cases

README

CICADA

Code Intelligence: Contextual Analysis, Discovery, and Attribution

What's New in 0.3

Why CICADA?

Quick Install

Privacy & Security

For Developers

Install & Configure

Enable PR Attribution (optional)

Automatic Re-indexing with Watch Mode

CLI Cheat Sheet

Troubleshooting

For AI Assistants

🧭 Which Tool Should You Use?

Core Search Tools

Git History & Attribution Tools

Advanced Features

Token-Friendly Responses

Learn by Doing (5–10 min each)

1. Safe Refactor Checklist

2. Untangle Legacy Intent

3. Cleanup Sprint

Documentation

Roadmap

Contributing

License

Star History

Repository Owner

Repository Details

Programming Languages

Tags

Topics

Related MCPs