- Organizations
- Gemini 2.5 Flash
Gemini 2.5 Flash: Benchmarks, Pricing & Context Window
Gemini 2.5 Flash is a language model from Google, released in May 2025, with multimodal input.
A thinking model designed for a balance between price and performance. It builds upon Gemini 2.0 Flash with upgraded reasoning, hybrid thinking control, multimodal capabilities (text, image, video, audio input), and a 1M token input
Gemini 2.5 Flash pricing
Providers
Gemini 2.5 Flash starts at $0.300 per million input tokens and $2.50 per million output tokens via Google.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency s | Throughput | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $0.300 | $2.50 | 1.0M | 65.5K | 3.83 | 35 c/s | — |
Gemini 2.5 Flash API
API access coming soon
Gemini 2.5 Flash will be available through our gateway shortly.
Gemini 2.5 Flash examples
Recent arena outputs from Gemini 2.5 Flash, picked from the highest-ranked matchups.
Gemini 2.5 Flash license
Gemini 2.5 Flash is released under the Proprietary license, which restricts commercial use, has a knowledge cutoff of January 2025.
- License
- Proprietary
- Non-commercial
- Knowledge cutoff
- January 2025
Proprietary license - usage restrictions apply
FAQ
Common questions about Gemini 2.5 Flash.