- Organizations
- OpenAI
- GPT-4.1 mini
GPT-4.1 mini: Benchmarks, Pricing & Context Window
GPT-4.1 mini is a language model from OpenAI, released in April 2025, with multimodal input.
GPT-4.1 mini provides a balance between intelligence, speed, and cost. It's a significant leap in small model performance, even beating GPT-4o in many benchmarks while reducing latency and cost.
GPT-4.1 mini pricing
Providers
GPT-4.1 mini starts at $0.400 per million input tokens and $1.60 per million output tokens via OpenAI.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency p95 s | Throughput P95 | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $0.400 | $1.60 | 1.0M | 32.8K | 2.56 | 347 c/s | — |
GPT-4.1 mini API
API access coming soon
GPT-4.1 mini will be available through our gateway shortly.
GPT-4.1 mini examples
Recent arena outputs from GPT-4.1 mini, picked from the highest-ranked matchups.
GPT-4.1 mini license
GPT-4.1 mini is released under the Proprietary license, which restricts commercial use, has a knowledge cutoff of May 2024.
- License
- Proprietary
- Non-commercial
- Knowledge cutoff
- May 2024
Proprietary license - usage restrictions apply
FAQ
Common questions about GPT-4.1 mini.