GPT OSS 120B High
Overview
GPT-OSS-120B High provides enhanced reasoning capabilities with high-effort thinking for complex problems. This variant offers deeper analysis and more thorough responses compared to the base model, making it ideal for challenging tasks that require extended reasoning. Activates 5.1B parameters per forward pass and is optimized to run on a single H100 GPU with native MXFP4 quantization.
GPT OSS 120B High was released on August 5, 2025. API access is available through OpenAI.
Performance
Timeline
Specifications
Benchmarks
GPT OSS 120B High Performance Across Datasets
Scores sourced from the model's scorecard, paper, or official blog posts
Pricing
Pricing, performance, and capabilities for GPT OSS 120B High across different providers:
| Provider | Input ($/M) | Output ($/M) | Max Input | Max Output | Latency (s) | Throughput | Quantization | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
OpenAI | $0.10 | $0.50 | 131.1K | 131.1K | 6.5 | 100.0 c/s | — | Text Image Audio Video | Text Image Audio Video |
API Access
API Access Coming Soon
API access for GPT OSS 120B High will be available soon through our gateway.
Recent Posts
Recent Reviews
FAQ
Common questions about GPT OSS 120B High