Cost events on every call.
Model calls fire token-estimate events with provider billing. Tool calls fire duration + handler-cost events. The trail is real-time, not end-of-month.
Model call → token-estimate event with the provider's per-token rate. Tool call → duration + handler cost (zero for built-ins, configurable for MCP).
Per-agent and per-tool aggregations update in real time. Daily, weekly, monthly windows are queryable directly from the cost dashboard.
When an aggregation crosses a warning threshold, an alert lands in the operator queue. Hard caps are not silent.
Once an agent hits its hard cap, the budget gate (point three of the safety gate) starts denying calls. The agent reports the budget block in the trace.
Operator can raise the cap, allow specific calls, or pause the agent until the next billing window. Each action records to audit.
Compliance reviews can ask 'what did we spend on agent X last quarter?' and get an answer from the audit + cost log together.