Edgee Blog Category: Engineering

Name: Edgee AI Gateway
Author: Edgee

March 24, 2026Engineering

Retry and Fallback: How Edgee Keeps Your LLM Requests Reliable

LLM providers go down, hit rate limits, and time out. Here's how Edgee handles retries, fallbacks, and provider scoring to keep your requests succeeding, transparently.

Julianne Hervier

Software Engineer

February 19, 2026Engineering

Achieving More With Less Using Token Compression

A first in a series overviewing token compression techniques, the way to evaluate them and the main challenges they face.

Khaled Maâmra

Research Engineer

September 2, 2025Engineering

Optimizing AI Inference with Edge Computing

Discover how edge computing can speed up AI inference. Learn how offloading tokenization and RAG to the edge improves latency, reduces costs, and enhances user experience.

Khaled Maâmra

Research Engineer

Contact us

Would you like to find out more about Edgee, test our services or our upcoming features? We’d love to hear from you. Please fill in the form below and we’ll be in touch.