Question 1

Why native SDKs instead of an OpenAI-compatible wrapper?

Accepted Answer

OpenAI-compatible wrappers flatten provider features. Claude's prompt caching, OpenAI's structured output, Anthropic's extended thinking, vision differences, all of it loses fidelity. We pay the cost of native adapters because the lost features cost more downstream.

Question 2

Can I mix providers across agents?

Accepted Answer

Yes. Agent-level provider config means Agent A runs on Claude, Agent B runs on OpenAI, Agent C runs on a local model with Claude as fallback. The pack mixes models per role.

Question 3

What's the in-house model exactly?

Accepted Answer

A managed model service we operate. You don't manage keys or vendor relationships; we route. Included usage on Basic, Pro, and Max plans, BYOK overflow available on any paid model tier.

Question 4

How does failover decide which provider to use as backup?

Accepted Answer

You configure priority order per agent. Primary first; on cooldown, second; etc. Each fallback is logged with the cost event so you can see which provider actually paid for the work.

Question 5

Can I add a new provider?

Accepted Answer

Yes, write an adapter that implements the GenerateOpts/StreamOpts interface. The provider system is extensible. We added the in-house model adapter the same way.

Question 6

Does this work on Cloud and Desktop?

Accepted Answer

Yes. Provider config is operator-controlled on either deployment. Desktop with local LLMs gives you a fully air-gapped path; Cloud with cloud providers gives you a managed path. Both work today.

Native SDK per provider. Switch per agent, per task, or globally.

The parts that make this work.

Claude keeps Claude.

OpenAI keeps OpenAI.

Local LLMs are first-class.

MCP servers extend the surface.

Health monitor + cooldown.

Per-agent provider routing.

The path through multi-provider.

Provider config picks the SDK.

Native call goes out.

Health monitor watches.

Failover (if configured).

Response normalizes upward.

Things engineers actually ask.

See it in your workspace.