Project Overview

Agent Memory

What This Is

A local, append-only conversational memory system for AI agents (Claude Code, OpenCode, Gemini CLI, GitHub Copilot CLI) that supports agentic search via a permanent hierarchical Table of Contents (TOC), grounded in time-based navigation. The TOC acts as a Progressive Disclosure Architecture: the agent always starts with summaries and navigates downward only when needed. Indexes (vector/BM25) are accelerators, not dependencies.

Core Value

An agent can answer "what were we talking about last week?" without scanning everything.

Time-based TOC navigation beats brute-force search. If everything else fails, the TOC + time hierarchy must work.

Progressive Disclosure Architecture (PDA)

The TOC implements Progressive Disclosure Architecture — the same pattern used in well-designed Agentic Skills. Just as a skill reveals complexity progressively, Agent Memory reveals conversation detail progressively:

Agentic Skills	Agent Memory
Start simple, reveal options as needed	Start with summaries, reveal events as needed
Agent discovers capabilities through exploration	Agent discovers answers through navigation
Complexity hidden until required	Raw events hidden until required

The key insight: Agentic search beats brute-force scanning.

Instead of loading thousands of events into context, an agent navigates:

Year → "2024: heavy focus on authentication" → drill down
Week → "Week 3: JWT implementation" → drill down
Day → "Thursday: token expiration debugging" → drill down
Segment → Summary bullets with grip links → expand grip
Grip → Raw event excerpt with full context → answer verified

This mirrors how humans search email: filter by date, scan subjects, open the relevant thread. The agent never reads everything — it uses summaries to navigate to exactly what it needs.

Requirements

Validated (v1.0.0 - Shipped 2026-01-30)

Storage & Foundation

Append-only event storage in RocksDB with time-prefixed keys
6 column families: events, toc_nodes, toc_latest, grips, outbox, checkpoints
Checkpoint-based crash recovery for background jobs
Per-project RocksDB instances
Configurable multi-agent mode (unified store with tags OR separate stores)

TOC Hierarchy

Time-based TOC hierarchy (Year → Month → Week → Day → Segment)
TOC nodes store title, bullets, keywords, child_node_ids
Segment creation on time threshold (30 min) or token threshold (4K)
Segment overlap for context continuity (5 min or 500 tokens)
Day/Week/Month rollup jobs with checkpointing
Versioned TOC nodes (append new version, don't mutate)

Grips (Provenance)

Grip struct with excerpt, event_id_start, event_id_end, timestamp, source
TOC node bullets link to supporting grips
Grips stored in dedicated column family
ExpandGrip returns context events around excerpt

Summarization

Pluggable Summarizer trait (async, supports API and local LLM)
Summarizer generates title, bullets, keywords from events
Summarizer extracts grips as evidence for bullets
Rollup summarizer aggregates child node summaries

gRPC Service & Query

gRPC IngestEvent RPC accepts Event message
GetTocRoot, GetNode, BrowseToc RPCs for TOC navigation
GetEvents, ExpandGrip RPCs for event retrieval
Health check and reflection endpoints

Integration

Hook handlers call daemon's IngestEvent RPC
CCH integration via memory-ingest binary (fail-open)
Claude Code plugin with 3 commands and memory-navigator agent
Query CLI for manual TOC navigation
Admin CLI for rebuild-toc, compact, status

Active (v2.0 Planning)

Teleport (Indexes as Accelerators)

BM25 teleport via Tantivy (embedded)
Vector teleport via local HNSW
Outbox-driven index ingestion (rebuildable)
Teleports return TOC node IDs or grip pointers, never content

Additional Hook Adapters

OpenCode hook adapter
Gemini CLI hook adapter

Production Hardening

Automated E2E tests in CI
RebuildToc admin command full implementation
Performance benchmarks

Out of Scope

Graph database — TOC is a tree stored as records, no graph DB needed
Multi-tenant concerns — single agent, local deployment
Deletes / mutable history — append-only truth
"Search everything all the time" — agentic navigation, not brute-force
Premature optimization — teleports come in Phase 2
HTTP server — gRPC only
MCP integration — hooks are passive listeners, no token overhead

Context

Ingestion via Hooks (Passive Capture)

Conversations are captured via agent hooks (Claude Code, OpenCode, Gemini CLI, GitHub Copilot CLI). Hook handlers send events to the daemon via gRPC. This is zero-token-overhead passive listening.

Event types (1:1 from hooks):

Hook Event	Memory Event
SessionStart	session_start
UserPromptSubmit	user_message
PostToolUse	tool_result
Stop	assistant_stop
SubagentStart	subagent_start
SubagentStop	subagent_stop
SessionEnd	session_end

Query Path

CLI client and agent skill query the daemon. Agent receives TOC navigation tools:

get_toc_root — top-level time periods
get_node(node_id) — drill into specific period
get_events(time_range) — raw events (last resort)
expand_grip(grip_id) — context around excerpt
teleport_query(query) — Phase 2+ index jump

Related Work

code_agent_context_hooks repo contains working hook handlers for Claude Code. This memory system is the backend those hooks feed into.

Constraints

Language: Rust — single binary, fast scans, predictable memory
API: gRPC only (tonic/prost) — no HTTP server
Storage: RocksDB — embedded, fast range scans, column families
Deployment: Standalone daemon, per-project stores
Platforms: macOS, Linux, Windows (cross-compile)
Multi-agent: Configurable — unified store (events tagged) or separate stores
Summarizer: Pluggable trait — API (Claude/GPT) or local inference
Config: Layered — defaults → config file (~/.config/agent-memory/) → env vars → CLI flags
Testing: Unit + Integration + Property-based + IQ/OQ

Key Decisions

Decision	Rationale	Outcome
TOC as primary navigation	Agentic search beats brute-force; indexes are disposable	✓ Validated in v1.0
Append-only storage	Immutable truth, no deletion complexity	✓ Validated in v1.0
Hooks for ingestion	Zero token overhead, works across agents	✓ Validated in v1.0
Per-project stores first	Simpler mental model, namespace for unified later	✓ Validated in v1.0
Time-only TOC for MVP	Topics deferred to Phase 4, time is sufficient for v1	✓ Validated in v1.0
gRPC only (no HTTP)	Clean contract, no framework churn	✓ Validated in v1.0
Pluggable summarizer	Start with API, swap to local later	✓ Validated in v1.0
Fail-open CCH integration	Never block Claude if memory is down	✓ Validated in v1.0

Last updated: 2026-01-30 after v1.0.0 milestone completion

Project Overview

Agent Memory

What This Is

Core Value

Progressive Disclosure Architecture (PDA)

Requirements

Validated (v1.0.0 - Shipped 2026-01-30)

Active (v2.0 Planning)

Out of Scope

Context

Constraints

Key Decisions

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Clone this wiki locally