Which models can I use as fallbacks?

Edgee hosts Gemma 4 26B, GLM-5, Qwen3 Coder 480B, Kimi K2.5, MiniMax M2.5, and Qwen3 Coder Next out of the box. You can also bring your own keys for any provider in our catalog (OpenAI, Anthropic, Mistral, DeepSeek, xAI, and more) or connect AWS Bedrock, Google Vertex AI, and Azure OpenAI for private-cloud fallback.

Does my Claude Code setup change when fallback activates?

No. Fallback is transparent. Claude Code receives a valid response just as if Anthropic had answered directly. The only visible difference: a brief indicator in your terminal telling you which model handled the request.

What happens if all fallback models also fail?

Edgee retries up to the end of your configured chain. If every fallback returns an error, we surface the original error to your client. You can configure as many fallback steps as your use case needs.

How is fallback usage billed?

Fallback models are billed at their individual per-token rate, which is typically lower than Anthropic's. You see fallback usage tracked separately in your dashboard so you always know the split.

Can I fall back to my own cloud account?

Yes. Connect your AWS Bedrock, Google Vertex AI, or Azure OpenAI credentials in the BYOK settings. Fallback traffic routes through your cloud account so data never leaves your trust boundary.

Is fallback included on the Free plan?

Fallback Models and rerouting are only available on the Team plan and above. The Free plan includes token compression and auto-retry on transient errors, but not cross-provider model fallback.

Fallback Models

Claude Code that never stops, on any model

Name: Edgee AI Gateway
Author: Edgee

When Anthropic goes down, your plan limit hits, or new credit policies kick in, Edgee keeps your Claude Code session running by routing to a fallback model automatically. Same prompt, same flow, zero code changes.

Start free See how it works

Works with Claude Code, Codex, OpenCode. Team plan required for full fallback.

edgee

Backed by founders and leaders of

Three failure scenarios every Claude Code user has experienced

You cannot ship when your model stops responding. Here is what happens in practice.

Anthropic outage mid-task

You are deep in a refactor and Claude Code stops responding. Status page says degraded. Your flow is broken, deadline looms.

Weekly plan limit reached

You hit your Opus cap on Tuesday. Four days of Sonnet or nothing until the reset. Your sprint plan does not care about the calendar.

Credit policy changes

Anthropic is moving to credit-based billing starting June 15, 2026. This shifts usage limits and introduces new quota mechanics. Teams will need a Plan B.

How it works in your dashboard

Configure once. Failover automatically.

In your Edgee dashboard, you set a priority-ordered model chain. Edgee handles the rest.

Fallback on outage

When the primary model returns a 429 or 5xx, Edgee instantly retries your request through the next configured fallback in the chain. Your Claude Code session keeps running.

Fallback on rate limit

Hit a weekly plan cap or a hard rate limit? Edgee detects the exhausted quota and transparently routes to a fallback model that is available and fast.

Always-on smart routing

Reroute models lets you always send requests to a specific model, regardless of what the client asked for. Use it for cost optimization or standardizing on a single provider fleet-wide.

Claude Code request
alex-chen · implement-feature.ts

Primary: Claude Opus
Weekly limit reached — 0 tokens remaining

Fallback 1: Mistral Large
Routed · 312ms to first token

Coding continues
Transparent to the dev

Supported models

6 Edgee-hosted models, ready to fall back

All available out of the box. No API keys needed. New models added regularly.

Gemma 4 26B

Google

Provided by Edgee

GLM-5

ZAI

Provided by Edgee

Qwen3 Coder 480B

Qwen

Provided by Edgee

Kimi K2.5

Moonshot AI

Provided by Edgee

MiniMax M2.5

MiniMax

Provided by Edgee

Qwen3 Coder Next

Qwen

Provided by Edgee

Plus your BYOK models

OpenAI, Anthropic, Mistral, DeepSeek, xAI, and more

Bring Your Own Cloud

One-click fallback to Bedrock, Vertex, or Azure

Same Claude Code setup. Zero code changes. Your cloud account, your data. You paste credentials once. Edgee routes your fallback traffic through your own cloud provider with the same latency and transparency as our hosted models.

AWS Bedrock credentials keyed by region
Google Vertex AI service account JSON
Azure OpenAI endpoint + API key
Credentials tested live before going active

Read the BYOK docs

AWS Bedrock

Multi-region credentials via access key. Edgee routes to Bedrock models in your AWS account.

Google Vertex AI

Service account JSON from Google Cloud Console. One paste, Edgee resolves OAuth2 tokens at runtime.

Azure OpenAI

Endpoint URL plus API key. Edgee derives the models endpoint from your deployment automatically.

How it works

Three steps. No config files. No proxy setup. No code changes.

1
Claude Code calls Anthropic through Edgee
Install the Edgee CLI once. Claude Code sends requests through the Edgee Agent Gateway instead of calling Anthropic directly. No code changes.
2
Edgee detects failure or threshold
If Anthropic returns a 429, 5xx, or your plan cap triggers, Edgee flags the request within milliseconds without showing an error to the user.
3
Edgee routes to your configured fallback
You set a priority-ordered model chain in the dashboard. Edgee routes the same request to the next available model. Same Claude Code session, zero interruption.

Read the technical deep dive

Claude Code requests Claude Opus

Anthropic returns 429

Rate limit reached

Edgee routes to GLM-5

Fallback model from your chain

Why this matters now

Anthropic credit-based billing shift takes effect June 15, 2026. This introduces credit quotas, new rate-limiting mechanics, and a different relationship between your spend and your access. If you depend on Claude Code daily, you need a Plan B that works without touching your workflow.

Edgee Fallback Models is not anti-Anthropic. It is a rational layer of resilience. We built this because we use Claude Code ourselves and could not afford downtime.

Without fallback vs with Edgee

What a Claude Code outage looks like before and after.

Comparison of Claude Code with and without Edgee Fallback Models
	Claude Code alone	Claude Code + Edgee Fallback
Downtime handling	Manual restart	Automatic fallback within ~300ms
Rate limit recovery	Wait for reset	Instant failover to next model
Model choice	One provider only	6+ Edgee-hosted models + BYOK
Setup time	—	< 2 minutes in dashboard
Fallback cost visibility	None	Tracked separately, billed at lower rates

Pricing

Fallback Models is included in Team

Automatic fallback and rerouting is a Team plan feature.

Team plan

For teams that cannot afford to stop coding when a provider goes down.

$29

per developer / month

Everything in Free, plus:

Fallback Models

Keep coding when a provider fails or rate-limits

Reroute Models

Forced routing to specific models to save costs

Team observability: manage developers and squads usage

Control team-wide usage, coding agent access, and squad organization

Spending cap per seat

Prevent unexpected costs and get alerts when you're close to your limit

GitHub integration (per repo / per PR attribution)

Attribution helps you track usage and costs

Upgrade to Team See all plans

Never lose your coding flow again.

Install in 30 seconds. Works with your existing Claude Code setup. No credit card.

Start free View pricing

Questions devs actually ask

Part of the Edgee Agent Gateway

Routing is one of three pillars.

Edgee also compresses tokens at the edge (up to 50% savings) and observes every session at team level with per-repo attribution.

See the full Agent Gateway →

Claude Code that never stops, on any model

Backed by founders and leaders of

Three failure scenarios every Claude Code user has experienced

Anthropic outage mid-task

Weekly plan limit reached

Credit policy changes

Configure once. Failover automatically.

Fallback on outage

Fallback on rate limit

Always-on smart routing

6 Edgee-hosted models, ready to fall back

One-click fallback to Bedrock, Vertex, or Azure

AWS Bedrock

Google Vertex AI

Azure OpenAI

How it works

Claude Code calls Anthropic through Edgee

Edgee detects failure or threshold

Edgee routes to your configured fallback

Why this matters now

Without fallback vs with Edgee

Fallback Models is included in Team

Team plan

Never lose your coding flow again.

Questions devs actually ask

Which models can I use as fallbacks?

Does my Claude Code setup change when fallback activates?

What happens if all fallback models also fail?

How is fallback usage billed?

Can I fall back to my own cloud account?

Is fallback included on the Free plan?

Routing is one of three pillars.