Model Comparison

GPT OSS 120B vs Nemotron 3 Super (120B A12B)

Nemotron 3 Super (120B A12B) significantly outperforms across most benchmarks. GPT OSS 120B is 1.1x cheaper per token.

Want to compare interactively?Try the playground

Performance Benchmarks

Comparative analysis across standard metrics

2 benchmarks

GPT OSS 120B outperforms in 0 benchmarks, while Nemotron 3 Super (120B A12B) is better at 2 benchmarks (GPQA, Humanity's Last Exam).

Nemotron 3 Super (120B A12B) significantly outperforms across most benchmarks.

Tue Jun 02 2026 • llm-stats.com

Arena Performance

Human preference votes

Pricing Analysis

Price comparison per million tokens

GPT OSS 120B costs less

For input processing, GPT OSS 120B ($0.09/1M tokens) is 1.1x cheaper than Nemotron 3 Super (120B A12B) ($0.10/1M tokens).

For output processing, GPT OSS 120B ($0.45/1M tokens) is 1.1x cheaper than Nemotron 3 Super (120B A12B) ($0.50/1M tokens).

In conclusion, Nemotron 3 Super (120B A12B) is more expensive than GPT OSS 120B.*

* Using a 3:1 ratio of input to output tokens

Lowest available price from all providers

Tue Jun 02 2026 • llm-stats.com

GPT OSS 120B

Input tokens$0.09

Output tokens$0.45

Best providerDeepinfra

Nemotron 3 Super (120B A12B)

Input tokens$0.10

Output tokens$0.50

Best providerDeepinfra

Notice missing or incorrect data?Start an Issue→

Model Size

Parameter count comparison

3.2B diff

Nemotron 3 Super (120B A12B) has 3.2B more parameters than GPT OSS 120B, making it 2.7% larger.

GPT OSS 120B

116.8Bparameters

Nemotron 3 Super (120B A12B)

120.0Bparameters

116.8B

GPT OSS 120B

120.0B

Nemotron 3 Super (120B A12B)

Context Window

Maximum input and output token capacity

Nemotron 3 Super (120B A12B) accepts 262,144 input tokens compared to GPT OSS 120B's 131,072 tokens. Nemotron 3 Super (120B A12B) can generate longer responses up to 262,144 tokens, while GPT OSS 120B is limited to 131,072 tokens.

GPT OSS 120B

Input131,072 tokens

Output131,072 tokens

Nemotron 3 Super (120B A12B)

Input262,144 tokens

Output262,144 tokens

Tue Jun 02 2026 • llm-stats.com

License

Usage and distribution terms

GPT OSS 120B is licensed under Apache 2.0, while Nemotron 3 Super (120B A12B) uses NVIDIA Open Model License Agreement .

License differences may affect how you can use these models in commercial or open-source projects.

GPT OSS 120B

Apache 2.0

Open weights

Nemotron 3 Super (120B A12B)

NVIDIA Open Model License Agreement

Open weights

Release Timeline

When each model was launched

GPT OSS 120B was released on 2025-08-05, while Nemotron 3 Super (120B A12B) was released on 2026-03-11.

Nemotron 3 Super (120B A12B) is 7 months newer than GPT OSS 120B.

GPT OSS 120B

Aug 5, 2025

10 months ago

Nemotron 3 Super (120B A12B)

Mar 11, 2026

2 months ago

7mo newer

Knowledge Cutoff

When training data ends

Nemotron 3 Super (120B A12B) has a documented knowledge cutoff of 2025-06-01, while GPT OSS 120B's cutoff date is not specified.

We can confirm Nemotron 3 Super (120B A12B)'s training data extends to 2025-06-01, but cannot make a direct comparison without GPT OSS 120B's cutoff date.

GPT OSS 120B

—

Nemotron 3 Super (120B A12B)

Jun 2025

Provider Availability

GPT OSS 120B is available from DeepInfra, Novita, OpenAI, Fireworks, Groq. Nemotron 3 Super (120B A12B) is available from DeepInfra.

GPT OSS 120B

Deepinfra

Input Price:Input: $0.09/1MOutput Price:Output: $0.45/1M

Novita

Input Price:Input: $0.10/1MOutput Price:Output: $0.50/1M

OpenAI

Input Price:Input: $0.10/1MOutput Price:Output: $0.50/1M

Fireworks

Input Price:Input: $0.15/1MOutput Price:Output: $0.60/1M

Groq

Input Price:Input: $0.15/1MOutput Price:Output: $0.60/1M

Nemotron 3 Super (120B A12B)

Deepinfra

Input Price:Input: $0.10/1MOutput Price:Output: $0.50/1M

* Prices shown are per million tokens

Outputs Comparison

Notice missing or incorrect data?Start an Issue discussion→

Key Takeaways

GPT OSS 120B

View details

OpenAI

Less expensive input tokens

Less expensive output tokens

Nemotron 3 Super (120B A12B)

View details

NVIDIA

Larger context window (262,144 tokens)

Higher GPQA score (82.7% vs 80.1%)

Higher Humanity's Last Exam score (22.8% vs 14.9%)

Detailed Comparison

AI Model Comparison Table
Feature	GPT OSS 120B	Nemotron 3 Super (120B A12B)

FAQ

Common questions about GPT OSS 120B vs Nemotron 3 Super (120B A12B).

Which is better, GPT OSS 120B or Nemotron 3 Super (120B A12B)?

Nemotron 3 Super (120B A12B) significantly outperforms across most benchmarks. GPT OSS 120B is made by OpenAI and Nemotron 3 Super (120B A12B) is made by NVIDIA. The best choice depends on your use case — compare their benchmark scores, pricing, and capabilities above.

How does GPT OSS 120B compare to Nemotron 3 Super (120B A12B) in benchmarks?

GPT OSS 120B scores MMLU: 90.0%, CodeForces: 82.1%, GPQA: 80.1%, TAU-bench Retail: 67.8%, HealthBench: 57.6%. Nemotron 3 Super (120B A12B) scores HMMT 2025: 94.7%, RULER: 91.8%, AIME 2025: 90.2%, WMT24++: 86.7%, MMLU-Pro: 83.7%.

Is GPT OSS 120B cheaper than Nemotron 3 Super (120B A12B)?

GPT OSS 120B is 1.1x cheaper for input tokens. GPT OSS 120B costs $0.09/M input and $0.45/M output via deepinfra. Nemotron 3 Super (120B A12B) costs $0.10/M input and $0.50/M output via deepinfra.

What are the context window sizes for GPT OSS 120B and Nemotron 3 Super (120B A12B)?

GPT OSS 120B supports 131K tokens and Nemotron 3 Super (120B A12B) supports 262K tokens. A larger context window lets you process longer documents, conversations, or codebases in a single request.

What are the main differences between GPT OSS 120B and Nemotron 3 Super (120B A12B)?

Key differences include context window (131K vs 262K), input pricing ($0.09 vs $0.10/M), licensing (Apache 2.0 vs NVIDIA Open Model License Agreement ). See the full comparison above for benchmark-by-benchmark results.

Who makes GPT OSS 120B and Nemotron 3 Super (120B A12B)?

GPT OSS 120B is developed by OpenAI and Nemotron 3 Super (120B A12B) is developed by NVIDIA.