Changelog

Every release of the Nimbus API and the nimbusapi.net site. Reverse-chronological.

We follow semver at a product level: major for breaking API or billing changes, minor for new models or features, patch for pricing corrections and bug fixes. Pricing corrections that under-charged customers are absorbed by Nimbus; pricing corrections that over-charged customers refund automatically to account balance.

2026-07-01v2.0.0major

Full rewrite of nimbusapi.net

New landing page with Three.js WebGL hero animation and cloud-parallax scroll.
28-model catalog surfaced across /docs/pricing and /docs/models with strikethrough retail prices.
Prepaid balance model replaces the old subscription tiers. No seat licenses, no commit.
Session-cookie auth replaces bearer-token portal auth. API keys still bearer.
Docs section rebuilt: 7 new billing/resource pages under /docs/*, real content top to bottom.
Admin console at /dashboard/admin gets card grid for users, invoices, refunds, cost caps.

2026-06-30v1.5.0minor

SpiderSense provider + Azure fallback chain

SpiderSense (China lab) added as a wholesale provider behind DeepSeek and Qwen models.
Azure fallback wired for Anthropic + OpenAI: when the direct upstream returns 5xx or 429, the request is retried against Azure Model Router before returning to the caller.
Fallback adds ~120ms p50 latency in the failure path; zero-cost on the success path.
New response header x-nimbus-upstream-route echoes which upstream served the request.

2026-06-24v1.4.2patch

DeepSeek V4 Pro added

DeepSeek V4 Pro live at $0.215 input / $0.435 output per 1M — 50% off the vendor list of $0.43 / $0.87.
1M context, strong tool calling, competitive with Claude Sonnet on public code benchmarks.
Available as deepseek/deepseek-v4-pro on the OpenAI-compatible endpoint.

2026-06-20v1.4.1patch

Gemini 3 Flash Preview pricing corrected

Gemini 3 Flash Preview was mispriced at $0.50 / $3.00 per 1M (the retail rate). Corrected to $0.25 / $1.50 (50% off retail).
Affected customers received an automatic balance credit for the delta of any usage from 2026-06-15 through 2026-06-20. Notification emails sent 2026-06-20 09:00 UTC.
Deployed a regression test that verifies every model in the catalog charges exactly retail ÷ 2.

2026-06-15v1.4.0minor

Claude Opus 4.8 launch

Anthropic released Claude Opus 4.8 at 12:00 UTC. Nimbus enabled it in production at 12:47 UTC.
Priced at $2.50 input / $12.50 output per 1M (retail: $5.00 / $25.00).
1M context window, extended thinking supported, tool use supported, vision supported.
Registered as anthropic/claude-opus-4.8. The 4.7 and 4.6 remain available at the same price.

2026-06-01v1.3.0minor

OpenCode + Continue integration guides

Published /docs/opencode with a copy-paste opencode.json config that routes to Nimbus.
Published /docs/continue with a config.json snippet for Continue.dev users.
Both providers verified to work with streaming, tool calling, and cache-control blocks.

2026-05-15v1.2.0minor

Cost caps per API key

Daily and monthly USD spend caps can now be set per API key from the dashboard.
Hard-cap mode returns 402 insufficient_balance the moment the cap is hit — no upstream call, zero tokens billed.
Soft-cap mode continues to serve but fires alert emails every 5 minutes until raised or reset.
Alert thresholds default to 50%, 80%, 95% of the cap. Emails + Discord webhook.
See /docs/cost-caps for the full walkthrough.

2026-05-01v1.1.0minor

Multi-provider fallback chains

Each API key can now register an ordered fallback chain of model IDs.
When the primary returns 429 or 5xx after one internal retry, the gateway automatically tries the next model in the chain.
Response header x-nimbus-model-used shows which model actually served the request.
Chain is only invoked on transient failures — 4xx from your side (bad request, cap exceeded) never falls through.

2026-04-01v1.0.0major

Initial launch with 12 models

Nimbus API launched publicly. OpenAI-compatible endpoint at https://llm.nimbusapi.net/v1.
Launch catalog: 3 Claude models, 4 GPT models, 2 Gemini models, DeepSeek V3, Qwen 2.5, Kimi K2.
Every price fixed at 50% off the vendor list rate.
Prepaid balance model, card + crypto top-up, 30-day refund policy.

The changelog is the source of truth. To get notified when a new model launches or when pricing changes, subscribe to release emails in /dashboard/settings or watch /docs for updates. RSS feed available at /docs/changelog.rss.