Directing each request to the right LLM provider at the right time, with automatic fallback when something goes wrong. Per-request retry and provider fallback is one of two routing techniques in the Edgee Agent Gateway; Plan-cap continuity is the other. When a provider request fails, Edgee automatically retries and falls back to the next available provider — transparently, without any changes to your code.Documentation Index
Fetch the complete documentation index at: https://www.edgee.ai/docs/llms.txt
Use this file to discover all available pages before exploring further.
How it works
Every request goes through an ordered list of providers. Edgee tries each one in sequence, retrying transient failures before moving on. If all providers are exhausted without success, the error from the last attempt is returned to the caller.Provider ordering
Fallback order is determined automatically by each provider’s success rate, computed from recent request history. Providers with higher success rates are tried first. When multiple providers have the same score, they are shuffled randomly for load distribution. If you use BYOK keys, only your own provider keys are eligible — Edgee’s shared providers are not used as fallbacks. If no BYOK key is available for a model, shared providers are used instead.Retry behavior
Edgee distinguishes three categories of error:| Category | Errors | Behavior |
|---|---|---|
| Retry then fallback | Rate limit (429), Service unavailable (5xx) | Retry the same provider once, then fall back |
| Immediate fallback | Timeout (408, 504), Credential not found, Stream parse error | Skip retry, move to next provider immediately |
| Terminal | Invalid token (401), Configuration error | Return error immediately — no retry, no fallback |