OpenAI logo

GPT OSS 120B

Overview

GPT-OSS-120B is an open-weight, 116.8B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized to run on a single H100 GPU with native MXFP4 quantization. The model supports configurable reasoning depth, full chain-of-thought access, and native tool use, including function calling, browsing, and structured output generation. It achieves near-parity with OpenAI o4-mini on core reasoning benchmarks. Note: While referred to as '120b' for simplicity, it technically has 116.8B parameters.

GPT OSS 120B was released on August 5, 2025. API access is available through 5 providers, including DeepInfra, Novita and others.

Performance

Timeline

ReleasedUnknown
Knowledge CutoffUnknown

Specifications

Parameters
116.8B
License
Apache 2.0
Training Data
Unknown

Benchmarks

GPT OSS 120B Performance Across Datasets

Scores sourced from the model's scorecard, paper, or official blog posts

LLM Stats Logollm-stats.com - Sat Dec 27 2025
Notice missing or incorrect data?Start an Issue discussion

Pricing

Pricing, performance, and capabilities for GPT OSS 120B across different providers:

ProviderInput ($/M)Output ($/M)Max InputMax OutputLatency (s)ThroughputQuantizationInputOutput
DeepInfra logo
DeepInfraint4
$0.09$0.45131.1K131.1Kint4
Text
Image
Audio
Video
Text
Image
Audio
Video
Novita logo
Novitabf16
$0.10$0.50131.1K131.1Kbf16
Text
Image
Audio
Video
Text
Image
Audio
Video
OpenAI logo
OpenAI
$0.10$0.50131.1K131.1K5.2115.0 tok/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Fireworks logo
Fireworks
$0.15$0.60131.0K30.0K
Text
Image
Audio
Video
Text
Image
Audio
Video
Groq logo
Groq
$0.15$0.60131.0K30.0K0.5500 tok/s
Text
Image
Audio
Video
Text
Image
Audio
Video

Price Comparison for GPT OSS 120B

Price per 1M input tokens (USD), lower is better

LLM Stats Logollm-stats.com - Sat Dec 27 2025

Throughput Comparison for GPT OSS 120B

Tokens per second, higher is better

LLM Stats Logollm-stats.com - Sat Dec 27 2025

Latency Comparison for GPT OSS 120B

Time to first token (s), lower is better

LLM Stats Logollm-stats.com - Sat Dec 27 2025

GPT OSS 120B API Providers: Price vs Throughput

API Access

API Access Coming Soon

API access for GPT OSS 120B will be available soon through our gateway.

Recent Posts

Recent Reviews

FAQ

Common questions about GPT OSS 120B

GPT OSS 120B was released on August 5, 2025.
GPT OSS 120B has 116.8 billion parameters.