Route OpenAI, Anthropic, Gemini, DeepSeek, Kimi, Qwen and 200+ more through one OpenAI-compatible endpoint. We buy tokens at wholesale and pass through the price. Five-nines uptime backed by automatic provider failover.
flatkey speaks the OpenAI Chat Completions API. Use any OpenAI SDK — just point its base_url at us.
# works with the official openai SDK by changing base_url curl https://api.flatkey.ai/v1/chat/completions \ -H "Authorization: Bearer $FLATKEY_KEY" \ -H "Content-Type: application/json" \ -d '{ "model": "claude-sonnet-4-6", "messages": [{"role":"user","content":"Hello"}] }'
We negotiate volume contracts with every upstream and pass the price through. The same number every month.
Prices per 1M tokens (input / output). Rates updated weekly. See full price list →
We negotiate bulk contracts with every upstream and pass through cost + a thin margin. No reseller markup. No surge windows. The price you sign up at is the price you pay next month.
When OpenAI 503s or Anthropic rate-limits you, we route the same request through a backup provider automatically. Your app sees one endpoint. We handle the rest.
OpenAI-compatible Chat Completions, Embeddings, and Vision. Use the OpenAI SDK you already have — change base_url to https://api.flatkey.ai/v1 and you're live.
200+ models from 12+ providers. New models added within 24 hours of release.
Every request goes through a 4-region edge with health checks every 30 seconds. We re-route around upstream incidents before your app's retry kicks in.
No credit card to start. $5 trial credit on signup. Pay as you go after.
Get an API key →