CocoIndex Code - Semantic Code Search via Vector Embeddings

Natural language code search through two complementary approaches: CLI (ccc) for speed and one-off queries, MCP server (1 tool: search) for AI agent integration via stdio transport.

1. WHEN TO USE

Activation Triggers

Use when:

User asks to "find code that does X" or "search for implementations of Y"
User needs to discover code by concept or intent rather than exact text
User wants to find similar code patterns across the codebase
Grep/Glob exact matching is insufficient and fuzzy or semantic matching is needed
User mentions "semantic search", "code search", "find similar code"
User needs to locate logic handling a specific concern (e.g., "where is the retry logic")
User wants to understand how a concept is implemented across multiple files
User asks "how is X implemented" or "what handles Y"
User wants to understand architecture or module relationships
Starting work on an unfamiliar part of the codebase (onboarding queries)
@context agent is exploring code structure and needs concept-based discovery
Any exploration task where the exact function/class name is unknown

Automatic Triggers:

"semantic search", "find code that", "search for implementations"
"similar code", "code that handles", "where is the logic for"
"cocoindex", "ccc", "vector search"
"find similar", "code search", "search codebase"
"how is", "what handles", "where does", "understand the"
"explore", "architecture", "module relationships"
"onboarding", "unfamiliar code", "new to this"

When NOT to Use

Do not use for:

Exact text or regex search (use Grep instead)
File name or path search (use Glob instead)
Reading known files (use Read instead)
The codebase has not been indexed yet (run ccc index first)
Simple string matching where the exact token is known
Non-code files (semantic search is optimized for source code)

2. SMART ROUTING

Resource Loading Levels

Level	When to Load	Resources
ALWAYS	Every skill invocation	references/tool_reference.md
CONDITIONAL	If intent signals match	references/search_patterns.md, references/cross_cli_playbook.md
ON_DEMAND	Only on explicit request	Full troubleshooting and configuration docs

Smart Router Pseudocode

The authoritative routing logic for scoped loading, weighted intent scoring, and ambiguity handling.

python

from pathlib import Path

SKILL_ROOT = Path(__file__).resolve().parent
RESOURCE_BASES = (SKILL_ROOT / "references", SKILL_ROOT / "assets")
DEFAULT_RESOURCE = "references/tool_reference.md"

INTENT_SIGNALS = {
    "SEARCH": {"weight": 4, "keywords": ["search", "find", "where", "similar", "semantic", "code that"]},
    "INDEX": {"weight": 4, "keywords": ["index", "reindex", "update index", "build index", "refresh"]},
    "INSTALL": {"weight": 4, "keywords": ["install", "setup", "configure", "ccc not found"]},
    "STATUS": {"weight": 3, "keywords": ["status", "stats", "how many files", "indexed"]},
    "TROUBLESHOOT": {"weight": 3, "keywords": ["error", "failed", "not working", "empty results"]},
    "CROSS_CLI": {"weight": 3, "keywords": ["copilot", "gemini", "claude", "codex", "cross cli", "multi query"]},
    "CONCURRENCY": {"weight": 3, "keywords": ["refresh_index", "concurrency", "concurrent", "follow-up query"]},
}

RESOURCE_MAP = {
    "SEARCH": ["references/search_patterns.md", "references/cross_cli_playbook.md", "references/tool_reference.md"],
    "INDEX": ["references/tool_reference.md"],
    "INSTALL": ["references/tool_reference.md"],
    "STATUS": ["references/tool_reference.md"],
    "TROUBLESHOOT": ["references/tool_reference.md", "references/cross_cli_playbook.md", "references/search_patterns.md"],
    "CROSS_CLI": ["references/cross_cli_playbook.md", "references/tool_reference.md"],
    "CONCURRENCY": ["references/cross_cli_playbook.md", "references/tool_reference.md"],
}

LOADING_LEVELS = {
    "ALWAYS": [DEFAULT_RESOURCE],
    "ON_DEMAND_KEYWORDS": ["full troubleshooting", "all commands", "configuration guide", "cross cli playbook"],
    "ON_DEMAND": ["references/tool_reference.md", "references/search_patterns.md", "references/cross_cli_playbook.md"],
}

def _task_text(task) -> str:
    parts = [
        str(getattr(task, "text", "")),
        str(getattr(task, "query", "")),
        " ".join(getattr(task, "keywords", []) or []),
    ]
    return " ".join(parts).lower()

def _guard_in_skill(relative_path: str) -> str:
    resolved = (SKILL_ROOT / relative_path).resolve()
    resolved.relative_to(SKILL_ROOT)
    if resolved.suffix.lower() != ".md":
        raise ValueError(f"Only markdown resources are routable: {relative_path}")
    return resolved.relative_to(SKILL_ROOT).as_posix()

def discover_markdown_resources() -> set[str]:
    docs = []
    for base in RESOURCE_BASES:
        if base.exists():
            docs.extend(p for p in base.rglob("*.md") if p.is_file())
    return {doc.relative_to(SKILL_ROOT).as_posix() for doc in docs}

def score_intents(task) -> dict[str, float]:
    """Weighted intent scoring from request text and capability signals."""
    text = _task_text(task)
    scores = {intent: 0.0 for intent in INTENT_SIGNALS}
    for intent, cfg in INTENT_SIGNALS.items():
        for keyword in cfg["keywords"]:
            if keyword in text:
                scores[intent] += cfg["weight"]
    if getattr(task, "has_error", False):
        scores["TROUBLESHOOT"] += 4
    if getattr(task, "index_missing", False):
        scores["INDEX"] += 5
    return scores

def select_intents(scores: dict[str, float], ambiguity_delta: float = 1.0, max_intents: int = 2) -> list[str]:
    ranked = sorted(scores.items(), key=lambda item: item[1], reverse=True)
    if not ranked or ranked[0][1] <= 0:
        return ["SEARCH"]
    selected = [ranked[0][0]]
    if len(ranked) > 1 and ranked[1][1] > 0 and (ranked[0][1] - ranked[1][1]) <= ambiguity_delta:
        selected.append(ranked[1][0])
    return selected[:max_intents]

def route_cocoindex_code_resources(task):
    inventory = discover_markdown_resources()
    intents = select_intents(score_intents(task), ambiguity_delta=1.0)
    loaded = []
    seen = set()

    def load_if_available(relative_path: str) -> None:
        guarded = _guard_in_skill(relative_path)
        if guarded in inventory and guarded not in seen:
            load(guarded)
            loaded.append(guarded)
            seen.add(guarded)

    for relative_path in LOADING_LEVELS["ALWAYS"]:
        load_if_available(relative_path)
    for intent in intents:
        for relative_path in RESOURCE_MAP.get(intent, []):
            load_if_available(relative_path)

    text = _task_text(task)
    if any(keyword in text for keyword in LOADING_LEVELS["ON_DEMAND_KEYWORDS"]):
        for relative_path in LOADING_LEVELS["ON_DEMAND"]:
            load_if_available(relative_path)

    if not loaded:
        load_if_available(DEFAULT_RESOURCE)

    return {"intents": intents, "resources": loaded}

3. HOW IT WORKS

Two Approaches

CocoIndex Code provides two access patterns for semantic code search:

CLI (ccc) - Direct terminal usage, fastest for one-off searches
MCP server - AI agent integration via ccc mcp (stdio mode)

CLI Approach (Primary) - CocoIndex Code CLI

Semantic Search

bash

# Basic semantic search
ccc search "error handling middleware" --limit 5

# Filter by language
ccc search "database connection" --lang typescript

# Filter by path
ccc search "authentication" --path "src/**"

# Combine filters
ccc search "retry logic" --lang python --path "lib/**" --limit 10

Index Management

bash

# Check index status
ccc status

# Build or update the index
ccc index

# Reset project databases (destructive)
ccc reset

Binary Location

text

.opencode/skill/mcp-coco-index/mcp_server/.venv/bin/ccc

Add to PATH or use the full path for invocation.

MCP Approach - AI Agent Integration

The MCP server exposes tools via ccc mcp running in stdio mode.

MCP Tool:

The MCP server exposes 1 tool only: search. The status, index, and reset operations are CLI-only commands and are NOT available as MCP tools.

Tool	Purpose	Key Parameters
`search`	Semantic search across codebase	`query` (str, required), `languages` (list\|null), `paths` (list\|null), `limit` (int, default 5), `offset` (int, default 0), `refresh_index` (bool, default True)

Embedding Models

CocoIndex Code supports two embedding models, configured via ~/.cocoindex_code/global_settings.yml:

Model	Type	Dimensions	API Key	Best For
`voyage/voyage-code-3` (primary)	Cloud via LiteLLM	1024	`VOYAGE_API_KEY` required	Higher quality code search
`sentence-transformers/all-MiniLM-L6-v2`	Local	384	None	Offline use, no API dependency

CRITICAL: Changing the embedding model requires ccc reset && ccc index because different models produce vectors with different dimensions. Mixing dimensions corrupts the index.

See references/settings_reference.md for full configuration details.

Root Path Discovery

CocoIndex Code resolves the project root in this order:

COCOINDEX_CODE_ROOT_PATH environment variable (explicit override)
Nearest parent directory containing .cocoindex_code/ directory
Nearest parent directory with project markers (.git, pyproject.toml, package.json, Cargo.toml, go.mod)
Current working directory (fallback)

Daemon Architecture

The CocoIndex Code daemon manages background indexing and serves search requests:

Auto-start: Starts automatically on the first CLI or MCP command
Auto-restart: Restarts on version mismatch or settings change
Multi-project: ProjectRegistry supports multiple projects simultaneously
Background indexing: Continues after client disconnect
Search during indexing: Search waits for indexing to complete (streams IndexWaitingNotice)
IPC: Binary msgpack over Unix socket (~/.cocoindex_code/daemon.sock)
Logs: ~/.cocoindex_code/daemon.log
PID file: ~/.cocoindex_code/daemon.pid

How Indexing Works

text

STEP 1: File Scanning
       +-- Scans project files respecting .gitignore
       +-- Detects language from file extensions
       +-- Supports 28+ languages (TypeScript, Python, Go, Rust, etc.)
       |
STEP 2: Chunk Splitting
       +-- RecursiveSplitter with language-aware boundaries
       +-- 1000 char chunks, 250 char minimum, 150 char overlap
       +-- Preserves function/class boundaries where possible
       +-- Produces many chunks for a typical large codebase
       |
STEP 3: Embedding Generation
       +-- Primary: voyage/voyage-code-3 via LiteLLM (1024-dim vectors)
       +-- Alternative: all-MiniLM-L6-v2 local model (384-dim vectors)
       +-- Model configured in ~/.cocoindex_code/global_settings.yml
       |
STEP 4: Vector Storage
       +-- Stores vectors in SQLite via sqlite-vec extension
       +-- Indexes for fast approximate nearest neighbor search
       |
STEP 5: Search Execution
       +-- Query text embedded with the same model
       +-- Cosine similarity comparison against stored vectors
       +-- Results ranked by similarity score
       +-- Language and path filters applied post-ranking

Search Result Interpretation

Each result includes:

File path - Location of the matching code
Chunk content - The code fragment that matched
Similarity score - Cosine similarity (0.0 to 1.0, higher is better)
Language - Detected programming language
Line range - Start and end lines within the file

Scores above 0.5 typically indicate strong semantic relevance. Always verify results with the Read tool since semantic search can surface false positives.

4. RULES

✅ ALWAYS

ALWAYS do these without asking:

ALWAYS check index status before searching
- Run ccc status before the first search in a session
- Confirm files are indexed and the index is not stale
ALWAYS use language filters when the target language is known
- --lang typescript narrows results and improves relevance
- Reduces false positives from similar patterns in other languages
ALWAYS use path filters to narrow scope
- --path "src/**" focuses on application code
- Avoids noise from test files, vendor directories, or build output
ALWAYS verify search results with the Read tool
- Semantic search can return false positives
- Read the actual file to confirm the match before acting on it
ALWAYS suggest reindexing if the codebase has changed significantly
- After major refactors, branch switches, or large merges
- Run ccc index from the project root to refresh the index
ALWAYS use the full binary path if ccc is not on PATH
- .opencode/skill/mcp-coco-index/mcp_server/.venv/bin/ccc
ALWAYS use the helper scripts when readiness is unclear
- bash .opencode/skill/mcp-coco-index/scripts/doctor.sh --strict --require-config
- bash .opencode/skill/mcp-coco-index/scripts/ensure_ready.sh --strict --require-config

Query Optimization

Write short, focused queries -- 2-5 words of natural language outperform long keyword lists:

Style	Example	Why it works
Good	"retry logic patterns"	Tight embedding vector, high similarity to relevant code
Good	"authentication middleware"	Single concept, precise matches
Avoid	"retry helper utilities, exponential backoff, max retries, failed operation retry wrappers"	Embedding dilution -- averages across too many concepts

Tips:

Describe the concept, not the implementation details
Use 2-5 words, not full sentences or keyword lists
If results are too broad, add a language or path filter instead of more keywords
Run multiple focused queries rather than one overloaded query

Concurrent Query Sessions

When sending multiple searches in sequence (e.g., exploring a codebase):

Set refresh_index=false after the first query -- the index only needs refreshing once per session
The daemon has a known concurrency issue where simultaneous refresh_index=true requests can cause ComponentContext errors
CLI equivalent: queries are sequential by nature, so this only applies to MCP usage

❌ NEVER

NEVER do these:

NEVER assume semantic search is 100% accurate
- Vector similarity is approximate, not exact
- Always cross-reference results with Grep for confirmation
NEVER run ccc reset without user confirmation
- This destroys the entire index
- Rebuilding requires a full reindex which takes time
NEVER use semantic search for exact string matching
- Use Grep for known tokens, function names, or exact patterns
- Semantic search is for conceptual or intent-based queries
NEVER skip the index status check before first search
- An empty or missing index returns no results silently
- Always verify with ccc status first
NEVER ignore low similarity scores
- Results below 0.3 are likely noise
- Focus on results above 0.5 for actionable matches

⚠️ ESCALATE IF

Ask user when:

ESCALATE IF index is empty or missing
- Guide user through ccc index to build the initial index
- Check if the binary is installed with command -v ccc
ESCALATE IF search returns no results for reasonable queries
- Index may be stale - suggest ccc index
- Query may need rephrasing with different terminology
ESCALATE IF binary not found
- Run the install script: bash .opencode/skill/mcp-coco-index/scripts/install.sh
- Verify Python 3.11+ is available
ESCALATE IF index build fails
- Check Python version (requires 3.11+)
- Check disk space for SQLite database
- Review error output for missing dependencies
ESCALATE IF results seem irrelevant despite high scores
- The embedding model may not capture domain-specific terminology well
- Suggest combining semantic search with Grep for better coverage

5. REFERENCES

Essential CLI Commands

bash

# Search
ccc search "query text"                    # Basic search
ccc search "query" --limit 10             # Limit results
ccc search "query" --lang typescript      # Filter by language
ccc search "query" --path "src/**"        # Filter by path

# Index management
ccc status                                 # Check index status
ccc index                                  # Build/update index
ccc reset                                  # Reset databases (destructive)

# Helper scripts
bash .opencode/skill/mcp-coco-index/scripts/doctor.sh [--json] [--strict] [--require-config] [--require-daemon] [--expect-config <path>]
bash .opencode/skill/mcp-coco-index/scripts/ensure_ready.sh [--json] [--refresh-index] [--strict] [--require-config] [--expect-config <path>]

MCP Tool Summary

The CocoIndex MCP server exposes search as its primary tool. Additionally, 3 management tools are available via the Spec Kit Memory MCP server's code graph module:

Tool	Server	Description	Key Parameters
`search`	CocoIndex	Semantic search across code	`query` (str), `languages` (list\|null), `paths` (list\|null), `limit` (int, default 5), `offset` (int, default 0), `refresh_index` (bool, default true)
`ccc_status`	Spec Kit Memory	Check CocoIndex availability and index stats	none
`ccc_reindex`	Spec Kit Memory	Trigger incremental or full re-indexing	`full` (bool, default false)
`ccc_feedback`	Spec Kit Memory	Submit search result quality feedback	`query` (str), `rating` (helpful\|not_helpful\|partial), `comment` (str, optional)

Note: refresh_index defaults to true. Use the default on the first query in a session, then switch follow-up queries to false when the codebase has not changed to avoid ComponentContext errors.

Companion recovery surface: In the integrated Spec Kit workflow, hookless runtimes should use session_bootstrap as the first recovery call, then use session_resume when they need the fuller merged recovery payload that includes direct CocoIndex availability fields.

Supported Languages

CocoIndex Code supports 28+ languages with language-aware chunk splitting:

Language	Extension	Language	Extension
TypeScript	.ts, .tsx	Python	.py
JavaScript	.js, .jsx	Go	.go
Rust	.rs	Java	.java
C	.c, .h	C++	.cpp, .hpp
C#	.cs	Ruby	.rb
PHP	.php	Swift	.swift
Kotlin	.kt	Shell	.sh
CSS	.css	DTD	.dtd
Fortran	.f, .f90	HTML	.html
JSON	.json	Lua	.lua
Pascal	.pas	R	.r, .R
Scala	.scala	Solidity	.sol
TOML	.toml	XML	.xml
YAML	.yml, .yaml	SQL	.sql

Decision Tree: Which Search Tool?

text

Need to find code?
  |
  +-- Know the exact text/token?
  |     YES --> Use Grep
  |
  +-- Know the file name?
  |     YES --> Use Glob
  |
  +-- Searching by concept/intent?
        YES --> Use ccc search

6. SUCCESS CRITERIA

Semantic Search Completion Checklist

Search workflow complete when:

Index status verified (ccc status shows files indexed)
Search returns relevant results for the natural language query
Results verified via Read tool match the semantic intent
Language and path filters applied where appropriate
False positives identified and filtered out
User receives actionable file paths and code locations

Quality Targets

Search relevance: Top 3 results contain at least one true match
Response time: Search returns results within 5 seconds
Filter accuracy: Language/path filters narrow results to target scope
Verification rate: All suggested matches confirmed via Read tool

7. INTEGRATION POINTS

Framework Integration

This skill operates within the behavioral framework defined in AGENTS.md.

Key integrations:

Gate 2: Skill routing via skill_advisor.py
Tool Routing: Per AGENTS.md Section 6 decision tree
Memory: Context preserved via Spec Kit Memory MCP

Complements Grep and Glob

Semantic search fills the gap between exact pattern matching and conceptual code discovery:

Tool	Best For	Limitation
Grep	Exact text, regex patterns	Cannot find conceptual matches
Glob	File names, path patterns	Cannot search file contents
ccc	Intent-based, conceptual search	Approximate - needs verification

Combined workflow example:

bash

# Step 1: Semantic search to find candidate files
ccc search "rate limiting middleware" --lang typescript --limit 5

# Step 2: Grep to verify exact patterns in candidates
grep -rn "rateLimit" src/middleware/

# Step 3: Read to confirm implementation details
# Use Read tool on the matched files

Related Skills

mcp-code-mode: For external tool integration via MCP

CocoIndex Code handles code search; Code Mode handles external APIs

system-spec-kit: For context preservation

Search results and decisions can be saved as memory for future sessions

Tool Usage Guidelines

Bash: All ccc commands, index management, status checks Read: Verify search results by reading matched files Grep: Confirm exact patterns after semantic search narrows candidates Glob: Locate files by name when semantic search identifies a module

External Tools

CocoIndex Code (ccc):

Installation: bash .opencode/skill/mcp-coco-index/scripts/install.sh
Update: bash .opencode/skill/mcp-coco-index/scripts/update.sh
Purpose: Semantic code search via vector embeddings
Requires: Python 3.11+, SQLite with sqlite-vec extension

8. RELATED RESOURCES

scripts/

Script	Purpose	Usage
install.sh	Install CocoIndex	`bash .opencode/skill/mcp-coco-index/scripts/install.sh`
update.sh	Update to latest	`bash .opencode/skill/mcp-coco-index/scripts/update.sh`
doctor.sh	Read-only health check	`bash .opencode/skill/mcp-coco-index/scripts/doctor.sh [--json] [--strict] [--require-config] [--require-daemon] [--expect-config <path>]`
ensure_ready.sh	Idempotent bootstrap	`bash .opencode/skill/mcp-coco-index/scripts/ensure_ready.sh [--json] [--refresh-index] [--strict] [--require-config] [--expect-config <path>]`

references/

Document	Purpose	Key Insight
tool_reference.md	Complete CLI/MCP docs	All commands, parameters, and options
search_patterns.md	Search query patterns	Effective query formulation and filter usage
cross_cli_playbook.md	Cross-CLI usage recipe	Safe defaults for repeated searches and troubleshooting
downstream_adoption_checklist.md	Downstream rollout checklist	Minimum bundle for sibling-repo adoption
settings_reference.md	Global settings config	Embedding model switching, daemon settings

assets/

Asset	Purpose
config_templates.md	MCP server config examples

Guides

INSTALL_GUIDE.md - Installation and initial setup
README.md - Skill overview and quick start

Related Skills

mcp-code-mode - MCP orchestration for external tools
system-spec-kit - Context preservation and memory

Search AI Tools

Install this agent skill to your Project

SKILL.md

CocoIndex Code - Semantic Code Search via Vector Embeddings

1. WHEN TO USE

Activation Triggers

When NOT to Use

2. SMART ROUTING

Resource Loading Levels

Smart Router Pseudocode

3. HOW IT WORKS

Two Approaches

CLI Approach (Primary) - CocoIndex Code CLI

Semantic Search

Index Management

Binary Location

MCP Approach - AI Agent Integration

Embedding Models

Root Path Discovery

Daemon Architecture

How Indexing Works

Search Result Interpretation

4. RULES

✅ ALWAYS

Query Optimization

Concurrent Query Sessions

❌ NEVER

⚠️ ESCALATE IF

5. REFERENCES

Essential CLI Commands

MCP Tool Summary

Supported Languages

Decision Tree: Which Search Tool?

6. SUCCESS CRITERIA

Semantic Search Completion Checklist

Quality Targets

7. INTEGRATION POINTS

Framework Integration

Complements Grep and Glob

Related Skills

Tool Usage Guidelines

External Tools

8. RELATED RESOURCES

scripts/

references/

assets/

Guides

Related Skills