đ© Nemotron 3 Nano Launch, what you need to know

Nemotron 3 Nano is an open 30B model built for agent workflows. It mixes Mambaâ2 and Transformer layers with a MixtureâofâExperts router that activates a few experts per token (6 of 128), so you get stronger reasoning without running all 31.6B params every step (~3.6B active). (aka longer context while staying cheap).
Why it matters
- Itâs fast: up to 4x higher token throughput than Nemotron 2 Nano, and strong throughput versus similar open models in likeâforâlike tests.
- It handles long input: up to 1M tokens. Most people will use smaller windows due to VRAM, but training included a 512k stage to keep shortâtask accuracy steady.
- Itâs controllable: toggle reasoning on/off and set a thinking budget to cap âreasoningâ tokens. Helps keep multiâagent costs predictable.
Quality and training
- Good results in math and coding benchmarks (GSM8K, MATH, HumanEval, MBPP), plus solid longâcontext scores as seen in the scores for the RULER benchmark below, with scores from Nemotron 3 Nano highlighted on the right.
- Postâtraining uses supervised fineâtuning, reinforcement learning across multiple environments, and RLHF. NeMo Gym and - NeMo RL are open so you can train and evaluate in similar setups.

Open and deployable
- Open weights, recipes, and large slices of datasets are released. Licensed for commercial use under NVIDIAâs Open Model License, with derivatives allowed and no claim on outputs. (HuggingFace)
- Run it on H100/A100 with vLLM or SGLang, or use llama.cpp/LM Studio for local tests. Itâs available via Hugging Face and major inference providers.
TL;DR
If youâre building agents that need speed, long context, and cost controls, Nemotron 3 Nano is a good starter. Route harder tasks to a frontier model when needed, and keep routine work on Nano.
Nemotron 3 Super (~100B) and Ultra (~500B) are planned for early 2026, aimed at higherâend reasoning while keeping efficiency in mind, as per their news release.
Read more on our in-depth analysis: https://llm-stats.com/blog/research/nemotron-3-nano-launchUse today through our playground: https://llm-stats.com/playground