Question 1

Why use compaction instead of just a longer context window?

Accepted Answer

Long contexts cost more, run slower, and are noisier, recall quality on the middle of long contexts is worse than on focused ones. Compaction keeps prompts focused without losing facts you'll need later (the daily log handles that).

Question 2

What stops summarization from being prompt-injected?

Accepted Answer

Summarization runs through injection gates: pre-sanitize strips known injection patterns from input messages, post-scan checks the summary itself before it gets used. A jailbreak hidden in an old message can't ride into the new context via the summary.

Question 3

How does tool-pair-aware truncation work?

Accepted Answer

tool_use and tool_result blocks have a paired identifier. The truncation algorithm treats them as a single unit, either both stay or both go. The model never gets a tool_use without its result, which would otherwise break the next generation.

Question 4

Can I see how context was assembled for a specific turn?

Accepted Answer

Yes. The activity trace shows the prompt structure with sizes per layer, which compaction stages fired, and what got flushed to the daily log. The whole assembly is inspectable.

Question 5

Does compaction destroy data?

Accepted Answer

No. Flush extraction writes facts to the daily log before compression runs. The original messages stay in the run-events table for replay. What changes is what the model sees, the audit trail keeps everything.

Five-layer context budget. Four-stage compaction. The model never runs out of room.

The parts that make this work.

Five layers, fixed budget.

Flush extraction at 60%.

Microcompaction trims tool noise.

LLM summarization preserves meaning.

Death-spiral fallback always succeeds.

Tool-pair truncation never breaks a call.

The path through context.

Budget computes per turn.

Identity-priority survives first.

Stage 0, flush extraction.

Stage 1, microcompaction.

Stage 2, LLM summarization.

Stage 3, death-spiral fallback.

Things engineers actually ask.

See it in your workspace.