- Organizations
- OpenAI
- o3-mini
o3-mini: Benchmarks, Pricing & Context Window
o3-mini is a language model from OpenAI, released in January 2025.
A smaller variant of O3, expected to offer enhanced multimodal capabilities, improved reasoning, and more efficient resource utilization compared to previous models while maintaining strong performance on core tasks.
o3-mini pricing
Providers
o3-mini starts at $1.10 per million input tokens and $4.40 per million output tokens via Azure. See all 2 providers below with their per-token pricing, latency, throughput, and modality support.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency s | Throughput | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $1.10 | $4.40 | 200.0K | 100.0K | 5.20 | 115 c/s | — | |||
| $1.10 | $4.40 | 200.0K | 100.0K | 5.20 | 115 c/s | — |
o3-mini API
API access coming soon
o3-mini will be available through our gateway shortly.
o3-mini examples
Recent arena outputs from o3-mini, picked from the highest-ranked matchups.
o3-mini license
o3-mini is released under the Proprietary license, which restricts commercial use, has a knowledge cutoff of September 2023.
- License
- Proprietary
- Non-commercial
- Knowledge cutoff
- September 2023
Proprietary license - usage restrictions apply
FAQ
Common questions about o3-mini.