One API.Every frontiermodel.
Nimbus routes OpenAI, Anthropic, Google and 12 more frontier providers behind one OpenAI-compatible endpoint. One key, one bill, up to 50% below market.
Six small superpowers that compound.
One key. One bill. Zero retention. Ship the AI features today, not next quarter.
SDK compatible
Drop-in replacement for OpenAI and Anthropic SDKs. Change the base URL, keep your code.
Learn morePrepaid balance
Top up any amount. No subscriptions, no monthly fees. Unused balance never expires.
Learn moreCost analytics
Per-model, per-key dashboards. Export CSV. Set hard spend caps before scaling.
Learn moreNo hard limits
Burst as hard as you need. Capacity scales with your balance, not a plan tier.
Learn moreZero retention
Prompts stream through and vanish. Only token counts persist — for your invoice.
Learn moreAll providers
OpenAI, Anthropic, Google, DeepSeek, Qwen, Z-AI, Moonshot. One key, one bill, one endpoint.
Learn moreFrontier models. One bill. Up to 50% cheaper.
Claude Opus 4.8
anthropic/claude-opus-4.8
Claude Opus 4.7
anthropic/claude-opus-4.7
Top-tier reasoning. Best for complex agents and long-context work.
Claude Opus 4.6
anthropic/claude-opus-4.6
Previous-gen Opus. Proven reasoning at the same price.
Claude Sonnet 4.6
anthropic/claude-sonnet-4.6
Balanced speed and intelligence. Great default for chat and tools.
Claude Haiku 4.5
anthropic/claude-haiku-4.5
Lowest latency. Built for high-throughput, real-time tasks.
GPT-5.5
openai/gpt-5.5
Most-used flagship. Strong general intelligence with 1M context.
GPT-5.4
openai/gpt-5.4
Best price-to-quality. Workhorse for high-volume pipelines.
GPT-5.4 mini
openai/gpt-5.4-mini
Compact GPT-5.4. Cheaper variant with the same skills, lower latency.
GPT-5.3 Codex
openai/gpt-5.3-codex
Code-tuned GPT-5.3. Best for agents, refactors and toolcalls.
GPT-5.1 Codex Max
openai/gpt-5.1-codex-max
Long-context code model. Built for repo-scale refactors.
GPT-5.1 Codex mini
openai/gpt-5.1-codex-mini
Tiny code model. Cheap autocomplete and quick agent loops.
Gemini 3.1 Pro Preview
google/gemini-3.1-pro-preview
Gemini Flash 3.5
google/gemini-3.5-flash
Fast, cheap Gemini. Great default for high-volume Google workloads.
Gemini 3 Flash Preview
google/gemini-3-flash-preview
DeepSeek V4 Pro
deepseek/deepseek-v4-pro
Flagship DeepSeek. 1M context, strong reasoning at a fraction of the price.
Qwen3 Coder
qwen/qwen3-coder
Code-specialized Qwen. 1M context, built for agentic coding workflows.
Qwen3.7 Max
qwen/qwen3.7-max
GLM-5
z-ai/glm-5
Latest Z.AI flagship. Strong reasoning and tool use for agents.
GLM 5.1
z-ai/glm-5.1
GLM 5.2
z-ai/glm-5.2
Kimi K2.6
moonshotai/kimi-k2.6
Moonshot agentic model with vision. Long context, great for tools.
Kimi K2.7 Code
moonshotai/kimi-k2.7-code
Gemini 2.5 Flash Image
google/gemini-2.5-flash-image
Gemini 3 Pro Image
google/gemini-3-pro-image
Gemini 3.1 Flash Image
google/gemini-3.1-flash-image
GPT-5 Image
openai/gpt-5-image
GPT-5 Image Mini
openai/gpt-5-image-mini
GPT-5.4 Image 2
openai/gpt-5.4-image-2
Teams burning over $2K/mo qualify for committed-use discounts up to 35%.
Ship in 60 seconds.
Top up
Add any amount — card, crypto, or wire. Funds land instantly.
Grab key
One key for every model and provider. Drop it into Claude Code, OpenCode, Cursor or any OpenAI/Anthropic SDK.
Ship it
Point your tool at our base URL. Pick a model — balance debits per token.
Answers to the basics.
Ship with every frontier model.
Top up any amount. No card on file, no surprises. Pay only for the tokens you actually burn.