- Organizations
- OpenAI
- GPT OSS 20B
GPT OSS 20B: Benchmarks, Pricing & Context Window
GPT OSS 20B is a language model from OpenAI, released in August 2025.
The gpt-oss-20b model (technically 20.9B parameters) achieves near-parity with OpenAI o4-mini on core reasoning benchmarks, while running efficiently on a single 80 GB GPU. The gpt-oss-20b model delivers similar results to OpenAI o3‑mini
GPT OSS 20B pricing
Providers
GPT OSS 20B starts at $0.0500 per million input tokens and $0.200 per million output tokens via Novita. See all 4 providers below with their per-token pricing, latency, throughput, and modality support.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency s | Throughput | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $0.0500 | $0.200 | 131.1K | 32.8K | — | — | bf16 | |||
| $0.100 | $0.500 | 131.0K | 30.0K | — | — | — | |||
| $0.100 | $0.500 | 131.0K | 30.0K | 0.38 | 1000 c/s | — | |||
| $0.100 | $0.500 | 131.1K | 131.1K | 5.20 | 115 c/s | — |
GPT OSS 20B API
API access coming soon
GPT OSS 20B will be available through our gateway shortly.
GPT OSS 20B examples
Recent arena outputs from GPT OSS 20B, picked from the highest-ranked matchups.
GPT OSS 20B license
GPT OSS 20B is released under the Apache 2.0 license, which permits commercial use, has 20.9B parameters.
- License
- Apache 2.0
- Commercial use allowed
- Parameters
- 20.9B
Apache License 2.0 - allows commercial use
FAQ
Common questions about GPT OSS 20B.