Nvidia logo

Nemotron 3 Nano (30B A3B)

Overview

Overview

Nemotron 3 Nano is a 31.6B hybrid MoE model optimized for fast, long‑context agentic reasoning. It mixes Mamba‑2 and Transformer layers with a sparse MoE router (~3.6B active params per token) to deliver up to 4× higher throughput than Nemotron 2 and strong accuracy across math, coding, and tools. It supports a 1M‑token context window, offers Reasoning ON/OFF and a thinking‑budget to control costs, and ships with open weights, data, and RL tooling (NeMo Gym/RL). Released Dec 15, 2025 under the NVIDIA Open Model License, it’s built as the efficient backbone for multi‑agent systems at scale.

Nemotron 3 Nano (30B A3B) was released on December 15, 2025. API access is available through DeepInfra.

Performance

Timeline

ReleasedUnknown
Knowledge CutoffUnknown

Specifications

Parameters
32.0B
License
NVIDIA Open Model License Agreement
Training Data
Unknown

Benchmarks

Benchmarks

Nemotron 3 Nano (30B A3B) Performance Across Datasets

Scores sourced from the model's scorecard, paper, or official blog posts

LLM Stats Logollm-stats.com - Tue Feb 03 2026
Notice missing or incorrect data?Start an Issue discussion

Pricing

Pricing

Pricing, performance, and capabilities for Nemotron 3 Nano (30B A3B) across different providers:

ProviderInput ($/M)Output ($/M)Max InputMax OutputLatency (s)ThroughputQuantizationInputOutput
DeepInfra logo
DeepInfrabfloat16
$0.06$0.24262.1K262.1K
bfloat16
Text
Image
Audio
Video
Text
Image
Audio
Video

API Access

API Access Coming Soon

API access for Nemotron 3 Nano (30B A3B) will be available soon through our gateway.

Recent Posts

Recent Reviews

Blog Posts

FAQ

Common questions about Nemotron 3 Nano (30B A3B)

Nemotron 3 Nano (30B A3B) was released on December 15, 2025 by Nvidia.
Nemotron 3 Nano (30B A3B) was created by Nvidia.
Nemotron 3 Nano (30B A3B) has 32.0 billion parameters.
Nemotron 3 Nano (30B A3B) is released under the NVIDIA Open Model License Agreement license. This is an open-source/open-weight license.
Nemotron 3 Nano (30B A3B) has a knowledge cutoff of November 2025. This means the model was trained on data up to this date and may not have information about events after this time.