dokoro — Agentic memory, engineered

§ 00 — Get started

Install per project

On npm as dokoro. One command — no clone, no build. Each project gets its own isolated memory in ./dokoro, so sessions, entities and tool-trust never leak between repos.

add to a project · run from the project directory

# add dokoro to the current project (Claude Code, or any MCP client)
claude mcp add dokoro -- npx -y dokoro

# CLI subcommands
npx dokoro init        # scaffold the .dokoro workspace
npx dokoro migrate     # run DB migrations
npx dokoro browse      # interactive memory browser (TUI)

# lean install — skip the ~100MB native vector deps (lazy-loaded)
npm install --omit=optional

§ 01 — The Five Layers

Memory, separated by function

Most memory plugins dump everything into one vector store and retrieve by fuzzy similarity. dokoro follows the CoALA taxonomy used by Letta, Zep and Mem0: each layer owns a different question, a storage home, and its own tools — so the agent fetches the right kind of memory, not the most textually similar one. Each layer is also retained for a different span — from a single task to all-time.

TOGETHER · one turn of work — read in, act, write back

reads — each layer answers its own question writes back — what was learned persists

§ 02 — The Lineage

How agent memory evolved

Agent memory has moved through four generations — from no memory at all, to one big similarity bucket, to tiered OS-like memory, to temporal graphs. dokoro sits at the current edge: function-separated layers, bi-temporal facts, and an affective layer that learns which tools to trust.

Gen 0 · 2022–23 — the bare LLM

The context window was the memory

An agent's only memory was its prompt — wiped at the end of every session. It re-learned the codebase, re-discovered decisions, and repeated tools that had already failed. Nothing persisted.

Gen 1 · 2023 — Mem0, early RAG

One vector store, retrieve by similarity

"Memory" became a single embedding index: dump everything in, pull back the most textually similar chunks. Useful, but undifferentiated — a stale plan, a one-off fact and a failed-tool note all compete in the same fuzzy ranking.

Gen 2 · 2023–24 — MemGPT / Letta

Tiered, OS-like memory

Borrowing from virtual memory: a small in-context "core" plus a large archival store the agent pages in and self-edits. Structure arrives — but the tiers are about size and recency, not about what kind of thing is being remembered.

Gen 3 · 2024 — Zep / Graphiti

The temporal knowledge graph

Facts became first-class and time-aware: entities and relations with validity windows, so you can ask what the graph believed at a past moment instead of overwriting history. Memory gains a timeline.

Gen 4 · 2025 — dokoro

Function-separated + affective

dokoro keeps the temporal graph (bi-temporal relations) and adds two moves no popular OSS lib makes together: memory split into five purpose-built layers (CoALA), and an affective layer that records every tool outcome and turns it into a routing policy — so the agent learns which tools and models to trust, not just what it once saw.

§ 04 — The Distinguishing Trait

Affective memory — learning what to trust

Every tool outcome is recorded; the agent asks dokoro_feedback_route for a ranked track record and biases itself accordingly. No other popular OSS memory library — Mem0, Letta, Zep, Cognee, LangMem — does this natively.

PIPELINE · feedback_record → ranked feedback_route

—Outcomes become a routing policy

Outcome & latency recorded per call; confidence when provided

dokoro_feedback_route · ranked scores

# MCP tools/call — ranked routing for this agent
{
  "name": "dokoro_feedback_route",
  "arguments": {
    "agent_id": "claude-code",
    "half_life_days": 14
  }
}

dokoro_session_recall
  n=89  decayed_rate=1.000
  wilson_lower=0.9583  confident=true
dokoro_entity_extract_deep
  n=142 success=125 timeout=15
  wilson_lower=0.8213  confident=true

Wilson lower bound rank

Ranking uses a Wilson lower bound on the success rate — a single lucky success can't outrank a long, proven record. The agent prefers the higher wilson_lower.

Recency decay half-life

Stale failures fade via half_life_days — an outcome from three months ago counts for less than one from yesterday.

Confidence gate sample size

A confident flag flips true only once a tool clears the minimum sample size. Raw aggregates stay in dokoro_feedback_query.

§ 05 — Time Travel

Bi-temporal facts

Every entity_relations row carries valid_from / valid_to (Zep / Graphiti-style). Facts are never destructively overwritten — a superseded fact has its window closed and a new slice opens. Drag as_of to query the graph at any point in time.

ENTITY · auth/session.ts —[uses]→ ?

uses → stateful-sessions

uses → jwt-stateless-tokens

Jan '26MarAprJunnow

2026-05-12

Window-closing on supersession is active for single-valued relations; genuinely many-valued relations like depends_on accumulate concurrent open facts instead of evicting each other. History is never deleted — it only stops surfacing in the default "now" view.

§ 06 — Persistence

Three backends, each to its strength

Structured data in SQLite, vectors in LanceDB, human-readable state on the filesystem. Hybrid search fuses FTS5 + vectors by Reciprocal Rank Fusion; an optional local LLM (Ollama) adds embeddings and deep extraction — the server runs fine without it.

SQLite · structured

Drizzle ORM

docs · entities · entity_relations
sessions · time_entries
tags · doc_tags · doc_entities
conversation_summaries
agent_feedback (affective)
FTS5 full-text index

LanceDB · vectors

semantic recall

doc_vectors + chunks
512-token windows, 128 overlap
nomic-embed-text embeddings
cosine similarity recall
RRF-fused with FTS5

Filesystem · readable

working state

current-workspace.md
daily/*.md session logs
plans/*.json (procedural)
questions.json
assets/* · lock.json

Ollama · optional

graceful fallback

nomic-embed-text → embeddings
llama3.2 → deep extraction
without it: regex extraction
incremental SHA-256 indexing

§ 07 — The Companion

The tachibot bridge

tachibot-mcp does the thinking; dokoro remembers it. Three opt-in bridge tools (DOKORO_ENABLE_TACHIBOT_BRIDGE=true) land each model's stateless output in the right layer — and feed it back into the next decision. Want the pair wired together out of the box? tachi-agent is the local-first orchestrator that fuses dokoro memory with the tachibot council.

LOOP · tachibot reasons → dokoro remembers → context flows back

—Stop re-researching what you already looked up

→ semantic (LanceDB) → procedural (plans) context back to tachibot

§ 08 — The Dashboard

Supervise agents live

npx dokoro browse is a terminal dashboard over the whole memory folder — ten categories, from the live workspace and plans to file claims, agent presence, open questions and the affective feedback ledger. File watchers and pollers keep every list live, changed preview lines flash, / fuzzy-filters, s runs hybrid FTS5+vector search, and ? overlays every keybinding.

dokoro browse · file claims — live

dokoro › File claims
──────────────────────────────────────────
▸ src/auth/session.ts    alice · live · 4m left
  src/cli/browse-ui.tsx  bob · stale · expired
──────────────────────────────────────────
↑/↓ move · enter open · r release · ? help · 1/2
⚑ holder is live — not releasing

Observe-first council-ruled

Read everywhere, write only where gated. A multi-model council rejected command palettes, multi-select and inline editing — a coordination dashboard must never race the agents it watches.

Liveness gate r · claims

r releases a stuck file claim only when the holder's heartbeat is stale past the 900s TTL or the lease expired. A live holder is refused — no force flag exists.

Fresh-read, drift abort p · plans

p advances a plan one legal step (draft→active→completed). The plan is re-read before writing; if its status drifted since you confirmed, the write aborts.

GATED MUTATION · key → confirm → fresh read → gate → write / refuse

—Press replay to trace an action through the gates

write lands, race-guarded in SQL refused with a toast — no force path

§ 10 — The Survey

How it compares

Four capabilities set dokoro apart — bi-temporal facts, per-agent affective feedback, workspace lock coordination, and WAL concurrency — all queryable as plain MCP tool calls or visible directly in the SQLite schema.

Project	Architecture	Native temporal	Native affective	Native multi-agent	Concurrent access
dokoro	SQLite + LanceDB + entity graph	✓ bi-temporal	✓ agent_feedback	✓ shared editable blocks + handoff + advisory file claims	✓ WAL + busy_timeout=5000
Mem0	Vector + optional graph	—	—	—	—
Letta (MemGPT)	Tiered, OS-like, self-editing	◐ via metadata	◐ via metadata	◐ shared blocks	—
Zep / Graphiti	Temporal knowledge graph	✓ bi-temporal	—	—	—
Cognee	Graph + vector poly-store	◐ partial	—	—	—
LangMem	Modular over LangGraph	—	—	—	—

§ 11 — Under the Hood

How it holds together

Layer-driven, gracefully degrading, and built on the MCP TypeScript SDK.

i · TAXONOMY

Function-separated memory

Five layers along the CoALA taxonomy — the agent retrieves the right kind of memory, not the most similar text.

ii · TEMPORAL

Non-destructive facts

Supersession closes a fact's valid_to window and opens a new slice. as_of replays the graph at any moment.

iii · AFFECTIVE

Outcomes as policy

Wilson lower bound + recency decay turn per-tool records into a routing signal the agent can trust.

iv · STORAGE

Right tool per job

SQLite for structure, LanceDB for vectors, files for human-readable state — fused by Reciprocal Rank Fusion.

v · DEGRADE

Runs without Ollama

No local LLM? Regex extraction takes over and recall falls back to recency. Nothing hard-fails.

vi · BRIDGE

Zero cost when off

Three opt-in tools wire tachibot's multi-model output into memory. Disabled by default.

vii · CLAIMS

Multi-agent file claims

Advisory per-file leases (dokoro_file_claim, default 300s) so agents sharing a worktree see who is editing what. Conflicts warn — they never block — and expired or dead-holder claims are taken over.

viii · BROWSE

dokoro browse

A live terminal dashboard over all ten memory categories — questions and feedback included — with ? help, fuzzy filter, hybrid search, and gated actions: release a stale claim, advance a plan. Falls back to a static summary when not a TTY.

ix · SCOPE

Strictly per-project

Isolation is a ruled invariant, not a default: one memory per project, cross-project writes forbidden, no registry, no --all. Sandboxes and CI get an explicit path — memory never leaks between repos.

An agent thatremembers.