Nemotron Nano 9B v2
Overview
NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and tasks by first generating a reasoning trace and then concluding with a final response. The model's reasoning capabilities can be controlled via a system prompt. If the user prefers the model to provide its final answer without intermediate reasoning traces, it can be configured to do so, albeit with a slight decrease in accuracy for harder prompts that require reasoning. Conversely, allowing the model to generate reasoning traces first generally results in higher-quality final solutions to queries and tasks.
Nemotron Nano 9B v2 was released on August 18, 2025.
Performance
Timeline
Specifications
Benchmarks
Nemotron Nano 9B v2 Performance Across Datasets
Scores sourced from the model's scorecard, paper, or official blog posts
Pricing
Pricing, performance, and capabilities for Nemotron Nano 9B v2 across different providers:
API Access
API Access Coming Soon
API access for Nemotron Nano 9B v2 will be available soon through our gateway.
Recent Posts
Recent Reviews
FAQ
Common questions about Nemotron Nano 9B v2