NvidiaReleased on Apr 7, 2025

Llama 3.1 Nemotron Ultra 253B v1: Benchmarks, Pricing & Size

Llama 3.1 Nemotron Ultra 253B v1 is a language model from Nvidia, released in April 2025.

A 253B parameter derivative of Meta Llama 3.1 405B Instruct, developed by NVIDIA using Neural Architecture Search (NAS) and vertical compression. It underwent multi-phase post-training (SFT for Math, Code, Reasoning, Chat, Tool Calling; RL

Llama 3.1 Nemotron Ultra 253B v1 model size

Llama 3.1 Nemotron Ultra 253B v1 has 253 billion parameters. See how it compares to other models in the same parameter range.

Parameters
253B
Frontier (200B+)
253B
1B7B70B405B

Llama 3.1 Nemotron Ultra 253B v1 API

API access coming soon

Llama 3.1 Nemotron Ultra 253B v1 will be available through our gateway shortly.

Llama 3.1 Nemotron Ultra 253B v1 examples

Recent arena outputs from Llama 3.1 Nemotron Ultra 253B v1, picked from the highest-ranked matchups.

Llama 3.1 Nemotron Ultra 253B v1 license

Llama 3.1 Nemotron Ultra 253B v1 is released under the Llama 3.1 Community License license, which restricts commercial use, has 253.0B parameters, has a knowledge cutoff of December 2023.

License
Llama 3.1 Community License
Non-commercial
Parameters
253.0B
Knowledge cutoff
December 2023

FAQ

Common questions about Llama 3.1 Nemotron Ultra 253B v1.

When was Llama 3.1 Nemotron Ultra 253B v1 released?

Llama 3.1 Nemotron Ultra 253B v1 was released on April 7, 2025 by Nvidia. This is the official Llama 3.1 Nemotron Ultra 253B v1 release date tracked on LLM Stats.

How big is Llama 3.1 Nemotron Ultra 253B v1?

Llama 3.1 Nemotron Ultra 253B v1 has 253 billion parameters. It ships as an open-weight model, so you can download and run it on your own hardware.

Who created Llama 3.1 Nemotron Ultra 253B v1?

Llama 3.1 Nemotron Ultra 253B v1 was created by Nvidia.

What is the license for Llama 3.1 Nemotron Ultra 253B v1?

Llama 3.1 Nemotron Ultra 253B v1 is released under the Llama 3.1 Community License license. This is an open-source / open-weight license that permits self-hosting.

What is the knowledge cutoff date for Llama 3.1 Nemotron Ultra 253B v1?

Llama 3.1 Nemotron Ultra 253B v1 has a knowledge cutoff of December 2023, meaning it was trained on data up to that point and may not know about events after it.

Where is the Llama 3.1 Nemotron Ultra 253B v1 paper or technical report?

Llama 3.1 Nemotron Ultra 253B v1 has a paper or technical report available at https://arxiv.org/abs/2502.00203. Use that source for architecture, training, release and evaluation details.