Question 1

Are costs estimated or actual?

Accepted Answer

Token counts are exact for Claude (Anthropic tokenizer) and OpenAI (tiktoken). For local LLMs, we use a calibrated heuristic measured against the tokenizer where one is available. Provider billing reconciliation happens against the actual bill, the dashboard shows estimate vs. invoiced for the prior month.

Question 2

What happens if an agent exceeds its budget mid-task?

Accepted Answer

The budget gate denies further calls. The current call completes if it's already in flight; the next one fails closed. The trace shows the budget-block reason. Operators can raise the cap or grant a one-off bypass with an audit entry.

Question 3

Can budgets be cumulative or just monthly?

Accepted Answer

Per-agent budgets default to monthly with rollover off. Configurable: daily / weekly / monthly / per-task. Pick what fits your spend pattern.

Question 4

Do tool budgets count separately from model budgets?

Accepted Answer

Yes. A tool with its own per-call cost (e.g., a paid web-search API) bills against its tool budget. Model calls bill against the agent's model budget. Both can fire alerts independently.

Question 5

How do I see total spend across the workspace?

Accepted Answer

The cost dashboard rolls up at workspace, agent, and tool levels. Filter by time window, by provider, by tool type. Export to CSV for finance reconciliation.

Per-agent and per-tool spend. Threshold alerts. No surprises.

The parts that make this work.

Cost events on every call.

Per-agent budgets.

Per-tool budgets.

Threshold alerts.

Cost trail attaches to traces.

Token estimates are real, not approximate.

The path through costs & budgets.

Cost event fires per call.

Events roll into budgets.

Threshold alerts surface.

Hard cap denies further calls.

Operator clears the cap.

Cost trail merges with audit.

Things engineers actually ask.

See it in your workspace.