Token Compression Gateway for your agents
Edgee compresses prompts before they reach LLM providers.
Same code, fewer tokens, lower bills.
How to use Edgee
Whether you’re using a coding agent or building an app, Edgee compresses your LLM traffic in minutes.
For coding agents
Start saving tokens in 1 minute
Install Edgee CLI and connect it to your coding agent. No code changes required.
- No code changes: works as a transparent proxy for your agent
- Instant savings: token compression kicks in on the first request
- Works with any agent: Claude Code, Codex, Cursor and more
Configure your coding agent
Connect Edgee to your AI coding assistant and start saving tokens in 1 minute.
Why Edgee AI Gateway?
An edge intelligence layer for your AI traffic
Edgee sits between your application and LLM providers behind a single OpenAI-compatible API. It adds edge-level intelligence, including token compression, routing policies, cost controls, private models, and tools, so you can ship AI features faster and with confidence.
Token compression
Reduce prompt size without losing intent to lower costs and latency, especially for long contexts, RAG pipelines, and multi-turn agents.
Learn moreEdge Tools
Invoke shared tools managed by Edgee, or deploy your own private tools at the edge, closer to users and providers for lower latency and tighter control.
Learn moreBring Your Own Keys
Use Edgee’s keys for convenience, or plug in your own provider keys for billing control and custom models.
Learn moreObservability
Monitor latency, errors, usage, and cost per model, per app, and per environment.
Learn morePrivate Models
Deploy serverless open-source LLMs on demand, where you need them, and expose them through the same gateway API alongside public providers.
Learn moreThe vision behind Edgee
Every technological shift creates a new foundation: the web had bandwidth, the cloud had compute, and AI has tokens. In a world powered by models, intelligence has a cost: tokens flow through every interaction, decision, and response.
At Edgee, we believe intelligence should move efficiently, closer to users, intent, and action. It should be compressed, routed, and optimized so decisions happen instantly. Hear from Sacha, Edgee’s co-founder, on how AI scales by mastering how intelligence moves.