- Organizations
- Nvidia
- Nemotron 3 Nano (30B A3B)
Nemotron 3 Nano (30B A3B): Benchmarks, Pricing & Context Window
Nemotron 3 Nano (30B A3B) is a language model from Nvidia, released in December 2025.
Nemotron 3 Nano is a 31.6B hybrid MoE model optimized for fast, long‑context agentic reasoning. It mixes Mamba‑2 and Transformer layers with a sparse MoE router (~3.6B active params per token) to deliver up to 4× higher throughput than
Nemotron 3 Nano (30B A3B) pricing
Providers
Nemotron 3 Nano (30B A3B) starts at $0.0600 per million input tokens and $0.240 per million output tokens via DeepInfra.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency s | Throughput | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $0.0600 | $0.240 | 262.1K | 262.1K | 7.15 | 144 c/s | bfloat16 |
Nemotron 3 Nano (30B A3B) API
API access coming soon
Nemotron 3 Nano (30B A3B) will be available through our gateway shortly.
Nemotron 3 Nano (30B A3B) examples
Recent arena outputs from Nemotron 3 Nano (30B A3B), picked from the highest-ranked matchups.
Nemotron 3 Nano (30B A3B) license
Nemotron 3 Nano (30B A3B) is released under the NVIDIA Open Model License Agreement license, which permits commercial use, has 32.0B parameters, has a knowledge cutoff of November 2025.
- License
- NVIDIA Open Model License Agreement
- Commercial use allowed
- Parameters
- 32.0B
- Knowledge cutoff
- November 2025
NVIDIA Open Model License Agreement
FAQ
Common questions about Nemotron 3 Nano (30B A3B).