Agent skill

agentdb-vector-search-optimization

Optimize AgentDB vector search performance using quantization for 4-32x memory reduction, HNSW indexing for 150x faster search, caching, and batch operations for scaling to millions of vectors.

Stars 232
Forks 15

Install this agent skill to your Project

npx add-skill https://github.com/aiskillstore/marketplace/tree/main/skills/dnyoussef/agentdb-vector-search-optimization

Metadata

Additional technical details for this skill

tags
agentdb optimization quantization hnsw-indexing performance
author
claude-flow
created
1761782400

SKILL.md

AgentDB Vector Search Optimization

Overview

Optimize AgentDB performance with quantization (4-32x memory reduction), HNSW indexing (150x faster search), caching, and batch operations for scaling to millions of vectors.

SOP Framework: 5-Phase Optimization

Phase 1: Baseline Performance (1 hour)

  • Measure current metrics (latency, throughput, memory)
  • Identify bottlenecks
  • Set optimization targets

Phase 2: Apply Quantization (1-2 hours)

  • Configure product quantization
  • Train codebooks
  • Apply compression
  • Validate accuracy

Phase 3: Implement HNSW Indexing (1-2 hours)

  • Build HNSW index
  • Tune parameters (M, efConstruction, efSearch)
  • Benchmark speedup

Phase 4: Configure Caching (1 hour)

  • Implement query cache
  • Set TTL and eviction policies
  • Monitor hit rates

Phase 5: Benchmark Results (1-2 hours)

  • Run comprehensive benchmarks
  • Compare before/after
  • Validate improvements

Quick Start

typescript
import { AgentDB, Quantization, QueryCache } from 'agentdb-optimization';

const db = new AgentDB({ name: 'optimized-db', dimensions: 1536 });

// Quantization (4x memory reduction)
const quantizer = new Quantization({
  method: 'product-quantization',
  compressionRatio: 4
});
await db.applyQuantization(quantizer);

// HNSW indexing (150x speedup)
await db.createIndex({
  type: 'hnsw',
  params: { M: 16, efConstruction: 200 }
});

// Caching
db.setCache(new QueryCache({
  maxSize: 10000,
  ttl: 3600000
}));

Optimization Techniques

Quantization

  • Product Quantization: 4-8x compression
  • Scalar Quantization: 2-4x compression
  • Binary Quantization: 32x compression

Indexing

  • HNSW: 150x faster, high accuracy
  • IVF: Fast, partitioned search
  • LSH: Approximate search

Caching

  • Query Cache: LRU eviction
  • Result Cache: TTL-based
  • Embedding Cache: Reuse embeddings

Success Metrics

  • Memory reduction: 4-32x
  • Search speedup: 150x
  • Accuracy maintained: > 95%
  • Cache hit rate: > 70%

Additional Resources

Expand your agent's capabilities with these related and highly-rated skills.

aiskillstore/marketplace

perigon-backend

Perigon ASP.NET Core + EF Core + Aspire conventions

232 15
Explore
aiskillstore/marketplace

perigon-agent

Pointers for Copilot/agents to apply Perigon conventions

232 15
Explore
aiskillstore/marketplace

perigon-angular

Angular 21+ standalone/Material/signal conventions for Perigon WebApp

232 15
Explore
aiskillstore/marketplace

fastapi-mastery

Comprehensive FastAPI development skill covering REST API creation, routing, request/response handling, validation, authentication, database integration, middleware, and deployment. Use when working with FastAPI projects, building APIs, implementing CRUD operations, setting up authentication/authorization, integrating databases (SQL/NoSQL), adding middleware, handling WebSockets, or deploying FastAPI applications. Triggered by requests involving .py files with FastAPI code, API endpoint creation, Pydantic models, or FastAPI-specific features.

232 15
Explore
aiskillstore/marketplace

context7-efficient

Token-efficient library documentation fetcher using Context7 MCP with 86.8% token savings through intelligent shell pipeline filtering. Fetches code examples, API references, and best practices for JavaScript, Python, Go, Rust, and other libraries. Use when users ask about library documentation, need code examples, want API usage patterns, are learning a new framework, need syntax reference, or troubleshooting with library-specific information. Triggers include questions like "Show me React hooks", "How do I use Prisma", "What's the Next.js routing syntax", or any request for library/framework documentation.

232 15
Explore
aiskillstore/marketplace

browser-use

Browser automation using Playwright MCP. Navigate websites, fill forms, click elements, take screenshots, and extract data. Use when tasks require web browsing, form submission, web scraping, UI testing, or any browser interaction.

232 15
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results