Agent skill

memora

Stars 386
Forks 46

Install this agent skill to your Project

npx add-skill https://github.com/agentic-box/memora/tree/main/skills/memora

SKILL.md

Memora — Persistent memory and knowledge management

Memora is an MCP memory server. Use it at session start to load context, when the user asks about past work or stored knowledge, and when saving important information for future sessions.

When to invoke

  • Session start: load relevant memories for the current task
  • User asks about past work, decisions, or stored knowledge
  • Saving important findings, decisions, architecture notes, or research
  • User explicitly asks to remember, recall, or search memories

Tool usage guidelines

Retrieval — use follow for clean results

Always use follow="active" when browsing or listing memories. This excludes superseded memories, so you only see current information — not outdated versions.

memory_list(tags_any=["memora/todos"], follow="active")
memory_semantic_search(query="cloud backend", follow="active")
memory_hybrid_search(query="swap loss research", follow="active")

Use follow="latest" when fetching a specific memory that may have been updated. This resolves through supersession chains to the current version automatically.

memory_get(memory_id=157, follow="latest")  # returns #170 if 157 was superseded
memory_semantic_search(query="roadmap", follow="latest")  # deduplicates versions

Use follow="full_history" to see the evolution of a topic. Returns all versions in the supersession chain, root to leaf.

memory_get(memory_id=170, follow="full_history")  # returns history: [#157, #170]

When to skip follow: When you need raw results regardless of supersession state (e.g., debugging, auditing, or examining the full unfiltered store).

Saving knowledge — use memory_absorb

Prefer memory_absorb over memory_create when saving knowledge. Absorb automatically checks for duplicates, supersedes outdated memories, links related ones, and consolidates related new facts into single richer memories.

Write detailed, context-rich facts — not tiny one-liners. Each fact should be a full sentence or short paragraph with enough context to be useful on its own. Related facts passed together are automatically merged into a single consolidated memory via LLM synthesis.

memory_absorb(
    facts=[
        "clmux v0.4.16.1 fixes hidden pane text leakage by restricting tmux allow-passthrough to only the visible TUI window instead of globally, preventing hidden reviewer and parking windows from leaking escape sequences",
        "clmux sidebar now filters out _reviewers and parking windows from list-panes queries, and the ! notification indicator persists across workspace switches until the user responds"
    ],
    source="manual",
    tags=["clmux", "bugfix"]
)

Absorb handles dedup and consolidation automatically:

  • Duplicate → skipped (no new memory created)
  • Update → creates new memory + supersedes the old one
  • Contradiction → creates new memory + links with contradicts edge
  • Related → creates new memory + links with related_to edge
  • New → creates new memory (no matches found)
  • Consolidated → related new facts merged into a single richer memory

Use dry_run=True to preview what absorb would do without writing.

Use memory_create directly only for:

  • Raw/unprocessed content you want stored verbatim
  • Structured entries (TODOs, issues, sections) via memory_create_todo, memory_create_issue, memory_create_section

Updating memories

Use memory_update only for corrections (typos, metadata fixes) — not for evolving knowledge. For evolving knowledge, use memory_absorb — it handles supersession automatically.

Linking

Use typed edges to express relationships:

  • supersedes — new version replaces old (enables lineage walking)
  • contradicts — conflicting information (flag for resolution)
  • implements — concrete implementation of a plan/design
  • extends — builds upon existing knowledge
  • references — general reference/citation
  • related_to — loose association

Search strategy

  1. Start with memory_semantic_search for conceptual queries (follow="active")
  2. Use memory_hybrid_search when you need both keyword and semantic matching
  3. Use memory_list with tag/metadata filters for structured browsing
  4. Use fields parameter to reduce response size: fields=["id", "content_preview", "tags"]
  5. Use content_mode="preview" (default) for scanning, content_mode="full" only when you need complete content

Context efficiency

  • Default limit=20 on list. Use limit=-1 only when you truly need everything.
  • Use fields projection to fetch only what you need.
  • Prefer content_mode="preview" (default) over full content for scanning.
  • Use memory_get with specific IDs after finding relevant results via search.

Expand your agent's capabilities with these related and highly-rated skills.

agentic-box/memora

memora

Use when working with persistent memory across sessions, storing/retrieving knowledge, managing TODOs/issues, or when context from previous sessions would be helpful.

386 46
Explore
davila7/claude-code-templates

verl-rl-training

Provides guidance for training LLMs with reinforcement learning using verl (Volcano Engine RL). Use when implementing RLHF, GRPO, PPO, or other RL algorithms for LLM post-training at scale with flexible infrastructure backends.

23,776 2,298
Explore
davila7/claude-code-templates

openrlhf-training

High-performance RLHF framework with Ray+vLLM acceleration. Use for PPO, GRPO, RLOO, DPO training of large models (7B-70B+). Built on Ray, vLLM, ZeRO-3. 2× faster than DeepSpeedChat with distributed architecture and GPU resource sharing.

23,776 2,298
Explore
davila7/claude-code-templates

gguf-quantization

GGUF format and llama.cpp quantization for efficient CPU/GPU inference. Use when deploying models on consumer hardware, Apple Silicon, or when needing flexible quantization from 2-8 bit without GPU requirements.

23,776 2,298
Explore
davila7/claude-code-templates

Claude Code Guide

Master guide for using Claude Code effectively. Includes configuration templates, prompting strategies "Thinking" keywords, debugging techniques, and best practices for interacting with the agent.

23,776 2,298
Explore
davila7/claude-code-templates

qdrant-vector-search

High-performance vector similarity search engine for RAG and semantic search. Use when building production RAG systems requiring fast nearest neighbor search, hybrid search with filtering, or scalable vector storage with Rust-powered performance.

23,776 2,298
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results