Four explicit banks.
Procedural (always-on guidance), working (current session), recent episodic, durable semantic. Each bank has its own budget, its own retrieval rules, its own freshness contract.
The pipeline classifies it as trivial, active follow-up, recall, or substantive default. Each class has a different memory budget.
Stable guidance, operator preferences, agent directives, manager feedback, loads from curated files without ranking. Always small, always present.
Session state plus the checkpoint summary from prior compaction. This is the in-flight conversation's compact form.
Recent episodic chunks query the episodic bank; durable semantic memory uses vector-first retrieval, then reranks by recency, entity intent, purpose, and quality.
Selected sections inject into the prompt under a hard budget. A per-turn memory trace records what was retrieved and why, attached to the debug snapshot.
Episodic chunks distill into durable knowledge. The bank gets cleaner over time, not noisier.