Question 1

How is this different from generic application monitoring?

Accepted Answer

Standard tools center on requests and errors. Agent runtimes have a different failure shape, a model can emit a tool name but no arguments, a gate can deny a call before it runs, a handler can succeed but return malformed output. PACKWOLF observes those boundaries directly.

Question 2

What's the difference between a generation and a tool observation?

Accepted Answer

A generation is the model's part: the prompt, output, and any tool calls it emitted. A tool observation is what happened after: did the gate allow it, did the handler run, what did it return. Confusing them sends debugging in the wrong direction. The trace UI keeps them separate.

Question 3

Can I replay a trace to debug?

Accepted Answer

Yes. Run events are append-only with full inputs and outputs. Open any past trace and inspect every span. Prompt + state + tool result + model output, all preserved.

Question 4

What does 'prompt versioning' actually do?

Accepted Answer

Every system prompt is content-hashed. Two traces using the same prompt show the same hash. Two traces with different prompts can be diffed. When agent behavior changes, you can find the change instantly.

Question 5

Can I share a specific span with a teammate?

Accepted Answer

Yes. URL-driven state means every span, filter, and time range is in the URL. Copy, paste, done.

Question 6

Does this slow down agent runs?

Accepted Answer

Observation writes are async and batched. The hot path runs at full speed; the trace catches up in milliseconds. We measured this so we could be sure.

A flame graph for agent execution. Failure taxonomy. Replay.

The parts that make this work.

Trace, not just log.

Generation vs. tool observation.

Failure taxonomy, not just errors.

Prompt versioning is built in.

URL-driven state.

Master-detail with keyboard nav.

The path through observability.

Pipeline emits run events.

Audit log captures policy.

Cost events fire alongside.

Trace UI groups observations.

Span detail shows the boundary.

Diff prompts across runs.

Things engineers actually ask.

See it in your workspace.