Question 1

How often should heartbeats run?

Accepted Answer

Configurable per agent. Common patterns: every 15 minutes for active workflows, hourly for monitoring agents, daily for review agents. Cost is dominated by execution, not assessment, so a quiet queue is cheap.

Question 2

What stops a heartbeat from running away?

Accepted Answer

Each beat is one pipeline turn, assessment plus a single execution. Multi-turn tool sequences within that execution still respect tool-loop detection and budget caps. A runaway can't span beats.

Question 3

What's the difference between a heartbeat and a chat turn?

Accepted Answer

Heartbeats start with an assessment over a queue; chat turns start with an operator message. After the start, the same pipeline runs. Same memory, same tools, same trace.

Question 4

Can I see why a heartbeat skipped?

Accepted Answer

Yes. The ledger captures the assessment payload, the queue snapshot, the priority scores, the model's reasoning, and the guardrail override (if any). Skips are inspectable, not silent.

Question 5

What happens to a heartbeat that fails mid-execution?

Accepted Answer

The checkpoint captures its state. The recovery API lets the operator resume from the checkpoint, accept partial side effects as final, or fail and discard. Nothing dangles in an undefined state.

Question 6

Can heartbeats coordinate across agents?

Accepted Answer

Yes, through the workflow + A2A surfaces. A heartbeat in one agent can hand off to another via agent-to-agent messages, and the receiver's next beat picks it up.

Assess. Then execute. Recoverable, inspectable, on schedule.

The parts that make this work.

Two phases, not one.

Priority scoring is deterministic.

Anti-skip guardrail.

Recoverable on failure.

Every beat has a ledger.

Schedules survive restarts.

The path through heartbeats.

Schedule fires.

Assessment phase.

Guardrail checks.

Execution phase.

Ledger captures everything.

Run is replayable.

Things engineers actually ask.

See it in your workspace.