OpenAI logo

GPT OSS 20B

Overview

Overview

The gpt-oss-20b model (technically 20.9B parameters) achieves near-parity with OpenAI o4-mini on core reasoning benchmarks, while running efficiently on a single 80 GB GPU. The gpt-oss-20b model delivers similar results to OpenAI o3‑mini on common benchmarks and can run on edge devices with just 16 GB of memory, making it ideal for on-device use cases, local inference, or rapid iteration without costly infrastructure. Both models also perform strongly on tool use, few-shot function calling, CoT reasoning (as seen in results on the Tau-Bench agentic evaluation suite) and HealthBench (even outperforming proprietary models like OpenAI o1 and GPT‑4o). Note: While referred to as '20b' for simplicity, it technically has 20.9B parameters.

GPT OSS 20B was released on August 5, 2025. API access is available through 4 providers, including Novita, Fireworks and others.

Performance

Timeline

ReleasedUnknown
Knowledge CutoffUnknown

Specifications

Parameters
20.9B
License
Apache 2.0
Training Data
Unknown

Benchmarks

Benchmarks

GPT OSS 20B Performance Across Datasets

Scores sourced from the model's scorecard, paper, or official blog posts

LLM Stats Logollm-stats.com - Mon Feb 09 2026
Notice missing or incorrect data?Start an Issue discussion

Pricing

Pricing

Pricing, performance, and capabilities for GPT OSS 20B across different providers:

ProviderInput ($/M)Output ($/M)Max InputMax OutputLatency (s)ThroughputQuantizationInputOutput
Novita logo
Novitabf16
$0.05$0.20131.1K32.8K
bf16
Text
Image
Audio
Video
Text
Image
Audio
Video
Fireworks logo
Fireworks
$0.10$0.50131.0K30.0K
Text
Image
Audio
Video
Text
Image
Audio
Video
Groq logo
Groq
$0.10$0.50131.0K30.0K
0.38
1000 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video
OpenAI logo
OpenAI
$0.10$0.50131.1K131.1K
5.2
115.0 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video

Price Comparison for GPT OSS 20B

Price per 1M input tokens (USD), lower is better

LLM Stats Logollm-stats.com - Mon Feb 09 2026

Throughput Comparison for GPT OSS 20B

Tokens per second, higher is better

LLM Stats Logollm-stats.com - Mon Feb 09 2026

Latency Comparison for GPT OSS 20B

Time to first token (s), lower is better

LLM Stats Logollm-stats.com - Mon Feb 09 2026

API Access

API Access Coming Soon

API access for GPT OSS 20B will be available soon through our gateway.

Recent Posts

Recent Reviews

FAQ

Common questions about GPT OSS 20B

GPT OSS 20B was released on August 5, 2025 by OpenAI.
GPT OSS 20B was created by OpenAI.
GPT OSS 20B has 20.9 billion parameters.
GPT OSS 20B is released under the Apache 2.0 license. This is an open-source/open-weight license.