Model Comparison

GPT OSS 120B High vs Nemotron 3 Super (120B A12B)

Nemotron 3 Super (120B A12B) shows notably better performance in the majority of benchmarks. GPT OSS 120B High and Nemotron 3 Super (120B A12B) cost the same.

Want to compare interactively?Try the playground

Performance Benchmarks

Comparative analysis across standard metrics

4 benchmarks

GPT OSS 120B High outperforms in 1 benchmarks (AIME 2025), while Nemotron 3 Super (120B A12B) is better at 3 benchmarks (GPQA, IFBench, MMLU-Pro).

Nemotron 3 Super (120B A12B) shows notably better performance in the majority of benchmarks.

Tue Jun 02 2026 • llm-stats.com

Arena Performance

Human preference votes

Pricing Analysis

Price comparison per million tokens

For input processing, GPT OSS 120B High ($0.10/1M tokens) costs the same as Nemotron 3 Super (120B A12B) ($0.10/1M tokens).

For output processing, GPT OSS 120B High ($0.50/1M tokens) costs the same as Nemotron 3 Super (120B A12B) ($0.50/1M tokens).

In conclusion, GPT OSS 120B High and Nemotron 3 Super (120B A12B) cost the same.*

* Using a 3:1 ratio of input to output tokens

Lowest available price from all providers

Tue Jun 02 2026 • llm-stats.com

GPT OSS 120B High

Input tokens$0.10

Output tokens$0.50

Best providerOpenAI

Nemotron 3 Super (120B A12B)

Input tokens$0.10

Output tokens$0.50

Best providerDeepinfra

Notice missing or incorrect data?Start an Issue→

Model Size

Parameter count comparison

3.2B diff

Nemotron 3 Super (120B A12B) has 3.2B more parameters than GPT OSS 120B High, making it 2.7% larger.

GPT OSS 120B High

116.8Bparameters

Nemotron 3 Super (120B A12B)

120.0Bparameters

116.8B

GPT OSS 120B High

120.0B

Nemotron 3 Super (120B A12B)

Context Window

Maximum input and output token capacity

Nemotron 3 Super (120B A12B) accepts 262,144 input tokens compared to GPT OSS 120B High's 131,072 tokens. Nemotron 3 Super (120B A12B) can generate longer responses up to 262,144 tokens, while GPT OSS 120B High is limited to 131,072 tokens.

GPT OSS 120B High

Input131,072 tokens

Output131,072 tokens

Nemotron 3 Super (120B A12B)

Input262,144 tokens

Output262,144 tokens

Tue Jun 02 2026 • llm-stats.com

License

Usage and distribution terms

GPT OSS 120B High is licensed under Apache 2.0, while Nemotron 3 Super (120B A12B) uses NVIDIA Open Model License Agreement .

License differences may affect how you can use these models in commercial or open-source projects.

GPT OSS 120B High

Apache 2.0

Open weights

Nemotron 3 Super (120B A12B)

NVIDIA Open Model License Agreement

Open weights

Release Timeline

When each model was launched

GPT OSS 120B High was released on 2025-08-05, while Nemotron 3 Super (120B A12B) was released on 2026-03-11.

Nemotron 3 Super (120B A12B) is 7 months newer than GPT OSS 120B High.

GPT OSS 120B High

Aug 5, 2025

10 months ago

Nemotron 3 Super (120B A12B)

Mar 11, 2026

2 months ago

7mo newer

Knowledge Cutoff

When training data ends

Nemotron 3 Super (120B A12B) has a documented knowledge cutoff of 2025-06-01, while GPT OSS 120B High's cutoff date is not specified.

We can confirm Nemotron 3 Super (120B A12B)'s training data extends to 2025-06-01, but cannot make a direct comparison without GPT OSS 120B High's cutoff date.

GPT OSS 120B High

—

Nemotron 3 Super (120B A12B)

Jun 2025

Provider Availability

GPT OSS 120B High is available from OpenAI, Fireworks. Nemotron 3 Super (120B A12B) is available from DeepInfra.

GPT OSS 120B High

OpenAI

Input Price:Input: $0.10/1MOutput Price:Output: $0.50/1M

Fireworks

Input Price:Input: $0.15/1MOutput Price:Output: $0.60/1M

Nemotron 3 Super (120B A12B)

Deepinfra

Input Price:Input: $0.10/1MOutput Price:Output: $0.50/1M

* Prices shown are per million tokens

Outputs Comparison

Notice missing or incorrect data?Start an Issue discussion→

Key Takeaways

GPT OSS 120B High

View details

OpenAI

Higher AIME 2025 score (92.5% vs 90.2%)

Nemotron 3 Super (120B A12B)

View details

NVIDIA

Larger context window (262,144 tokens)

Higher GPQA score (82.7% vs 80.9%)

Higher IFBench score (72.6% vs 69.5%)

Higher MMLU-Pro score (83.7% vs 80.7%)

Detailed Comparison

AI Model Comparison Table
Feature	GPT OSS 120B High	Nemotron 3 Super (120B A12B)

FAQ

Common questions about GPT OSS 120B High vs Nemotron 3 Super (120B A12B).

Which is better, GPT OSS 120B High or Nemotron 3 Super (120B A12B)?

Nemotron 3 Super (120B A12B) shows notably better performance in the majority of benchmarks. GPT OSS 120B High is made by OpenAI and Nemotron 3 Super (120B A12B) is made by NVIDIA. The best choice depends on your use case — compare their benchmark scores, pricing, and capabilities above.

How does GPT OSS 120B High compare to Nemotron 3 Super (120B A12B) in benchmarks?

GPT OSS 120B High scores AIME 2025: 92.5%, MMMLU: 83.8%, LiveCodeBench v6: 81.9%, GPQA: 80.9%, MMLU-Pro: 80.7%. Nemotron 3 Super (120B A12B) scores HMMT 2025: 94.7%, RULER: 91.8%, AIME 2025: 90.2%, WMT24++: 86.7%, MMLU-Pro: 83.7%.

Is GPT OSS 120B High cheaper than Nemotron 3 Super (120B A12B)?

Both models cost $0.10 per million input tokens.

What are the context window sizes for GPT OSS 120B High and Nemotron 3 Super (120B A12B)?

GPT OSS 120B High supports 131K tokens and Nemotron 3 Super (120B A12B) supports 262K tokens. A larger context window lets you process longer documents, conversations, or codebases in a single request.

What are the main differences between GPT OSS 120B High and Nemotron 3 Super (120B A12B)?

Key differences include context window (131K vs 262K), licensing (Apache 2.0 vs NVIDIA Open Model License Agreement ). See the full comparison above for benchmark-by-benchmark results.

Who makes GPT OSS 120B High and Nemotron 3 Super (120B A12B)?

GPT OSS 120B High is developed by OpenAI and Nemotron 3 Super (120B A12B) is developed by NVIDIA.