Memory Segmentation

An agent-agnostic memory architecture that separates session, semantic, episodic, and procedural memory for reliable long-running workflows.

Memory segmentation is a second axis on top of Memory Types. Types describe the kind of memory (rule, decision, fact, note, skill). Segmentation describes where memory lives across the lifecycle (session, semantic, episodic, procedural).

Why Segmentation Matters

LLM agents are stateless between calls. If all memory is treated as one flat transcript, quality degrades as context grows.

Segmentation fixes this by giving each memory lane one clear job:

Session memory: active working context
Semantic memory: durable truths and preferences
Episodic memory: chronological history and boundary snapshots
Procedural memory: reusable workflows and playbooks

This model is agent-agnostic. It works whether you use CLI tools, MCP clients, SDK-based apps, or custom harnesses.

The Four Memory Lanes

1) Session Memory (working set)

Use for the current task: recent turns, active goals, and boundary checkpoints.

Properties:

high read frequency during an active task
bounded size (subject to compaction)
explicitly checkpointed before destructive boundaries

2) Semantic Memory (durable truths)

Use for stable, re-usable truth: standards, constraints, architecture decisions, durable preferences.

Properties:

concise and current
overwrite/supersede when truth changes
injected consistently in retrieval

3) Episodic Memory (timeline fidelity)

Use for history: what happened, when it happened, and task boundary snapshots.

Properties:

append-oriented
optimized for chronology and handoffs
can preserve raw high-fidelity slices when needed

4) Procedural Memory (reusable workflows)

Use for repeatable execution patterns: checklists, runbooks, escalation flows, and successful routines.

Properties:

intent-matched retrieval
promoted from successful repeated episodes
improves first-action quality on recurring work

Compaction and Lifecycle Triggers

Compaction compresses session context so work can continue inside finite context windows. Good compaction is checkpoint-first, not loss-first.

Trigger	When it fires	Typical mechanism
Count-based	Token/turn budget near limit	Checkpoint + compact
Time-based	Session inactive past threshold	Background compaction worker
Event-based	Task boundary (reset/handoff/complete)	Boundary snapshot + close/rollover

Reference flow:

Retrieve context with session hints (token/turn/inactivity state).
If compaction is required, write a checkpoint first.
Compact session context.
Continue with refreshed context.
On boundary events, persist a snapshot.

Routing Guide: What Goes Where

Memory content	Primary lane	Why
Active task steps	Session	Immediate continuity
Stable rule or preference	Semantic	Durable and always relevant
Timeline event/milestone	Episodic	Chronological audit and handoff fidelity
Repeatable checklist/runbook	Procedural	Reuse on intent match
Contradictory old truth	Semantic (superseded)	Preserve one current truth

Update and Consolidation Rules

Segmentation only works if memory can evolve.

Filter before writing: not every turn is memory-worthy.
Consolidate duplicates: merge equivalent entries.
Overwrite/supersede stale semantic truth: avoid stacking contradictions.
Promote repeated successful patterns into procedural memory.

Agent-Agnostic Implementation Pattern

You can implement this model with many stacks (database-first, file-first, hybrid). The core contract is the same:

Define lane boundaries (session, semantic, episodic, procedural).
Define write triggers (count/time/event/explicit user intent).
Define retrieval policy (which lanes are always injected vs query-driven).
Define consolidation policy (merge, supersede, conflict handling).
Define observability (compaction rate, contradiction trend, retrieval quality).

Optional File-Mode Mapping

If you prefer human-readable files (for git workflows), map the same lanes to files:

# semantic
memory.md

# episodic daily timeline
memory/daily/YYYY-MM-DD.md

# boundary snapshots
memory/snapshots/YYYY-MM-DD/<slug>.md

Anti-patterns

Storing full raw transcripts in semantic memory
Treating all memory as one flat list
Letting conflicting semantic entries stack without consolidation
Skipping checkpoint writes before compaction
Using only one trigger type (count/time/event) for all workloads

Next Steps

Memory Types
memories session
memories compact
openclaw memory (optional file-mode implementation)

Memory Segmentation

On this page