Changelog
Every release of the Nimbus API and the nimbusapi.net site. Reverse-chronological.
We follow semver at a product level: major for breaking API or billing changes, minor for new models or features, patch for pricing corrections and bug fixes. Pricing corrections that under-charged customers are absorbed by Nimbus; pricing corrections that over-charged customers refund automatically to account balance.
2026-07-01v2.0.0major
Full rewrite of nimbusapi.net
- New landing page with Three.js WebGL hero animation and cloud-parallax scroll.
- 28-model catalog surfaced across /docs/pricing and /docs/models with strikethrough retail prices.
- Prepaid balance model replaces the old subscription tiers. No seat licenses, no commit.
- Session-cookie auth replaces bearer-token portal auth. API keys still bearer.
- Docs section rebuilt: 7 new billing/resource pages under /docs/*, real content top to bottom.
- Admin console at /dashboard/admin gets card grid for users, invoices, refunds, cost caps.
2026-06-30v1.5.0minor
SpiderSense provider + Azure fallback chain
- SpiderSense (China lab) added as a wholesale provider behind DeepSeek and Qwen models.
- Azure fallback wired for Anthropic + OpenAI: when the direct upstream returns 5xx or 429, the request is retried against Azure Model Router before returning to the caller.
- Fallback adds ~120ms p50 latency in the failure path; zero-cost on the success path.
- New response header x-nimbus-upstream-route echoes which upstream served the request.
2026-06-24v1.4.2patch
DeepSeek V4 Pro added
- DeepSeek V4 Pro live at $0.215 input / $0.435 output per 1M — 50% off the vendor list of $0.43 / $0.87.
- 1M context, strong tool calling, competitive with Claude Sonnet on public code benchmarks.
- Available as deepseek/deepseek-v4-pro on the OpenAI-compatible endpoint.
2026-06-20v1.4.1patch
Gemini 3 Flash Preview pricing corrected
- Gemini 3 Flash Preview was mispriced at $0.50 / $3.00 per 1M (the retail rate). Corrected to $0.25 / $1.50 (50% off retail).
- Affected customers received an automatic balance credit for the delta of any usage from 2026-06-15 through 2026-06-20. Notification emails sent 2026-06-20 09:00 UTC.
- Deployed a regression test that verifies every model in the catalog charges exactly retail ÷ 2.
2026-06-15v1.4.0minor
Claude Opus 4.8 launch
- Anthropic released Claude Opus 4.8 at 12:00 UTC. Nimbus enabled it in production at 12:47 UTC.
- Priced at $2.50 input / $12.50 output per 1M (retail: $5.00 / $25.00).
- 1M context window, extended thinking supported, tool use supported, vision supported.
- Registered as anthropic/claude-opus-4.8. The 4.7 and 4.6 remain available at the same price.
2026-06-01v1.3.0minor
OpenCode + Continue integration guides
- Published /docs/opencode with a copy-paste opencode.json config that routes to Nimbus.
- Published /docs/continue with a config.json snippet for Continue.dev users.
- Both providers verified to work with streaming, tool calling, and cache-control blocks.
2026-05-15v1.2.0minor
Cost caps per API key
- Daily and monthly USD spend caps can now be set per API key from the dashboard.
- Hard-cap mode returns 402 insufficient_balance the moment the cap is hit — no upstream call, zero tokens billed.
- Soft-cap mode continues to serve but fires alert emails every 5 minutes until raised or reset.
- Alert thresholds default to 50%, 80%, 95% of the cap. Emails + Discord webhook.
- See /docs/cost-caps for the full walkthrough.
2026-05-01v1.1.0minor
Multi-provider fallback chains
- Each API key can now register an ordered fallback chain of model IDs.
- When the primary returns 429 or 5xx after one internal retry, the gateway automatically tries the next model in the chain.
- Response header x-nimbus-model-used shows which model actually served the request.
- Chain is only invoked on transient failures — 4xx from your side (bad request, cap exceeded) never falls through.
2026-04-01v1.0.0major
Initial launch with 12 models
- Nimbus API launched publicly. OpenAI-compatible endpoint at https://llm.nimbusapi.net/v1.
- Launch catalog: 3 Claude models, 4 GPT models, 2 Gemini models, DeepSeek V3, Qwen 2.5, Kimi K2.
- Every price fixed at 50% off the vendor list rate.
- Prepaid balance model, card + crypto top-up, 30-day refund policy.
Subscribe
The changelog is the source of truth. To get notified when a new model launches or when pricing changes, subscribe to release emails in /dashboard/settings or watch /docs for updates. RSS feed available at /docs/changelog.rss.