- Organizations
- Gemini 2.5 Flash-Lite
Gemini 2.5 Flash-Lite: Benchmarks, Pricing & Context Window
Gemini 2.5 Flash-Lite is a language model from Google, released in June 2025, with multimodal input.
Gemini 2.5 Flash-Lite is a model developed by Google DeepMind, designed to handle various tasks including reasoning, science, mathematics, code generation, and more. It features advanced capabilities in multilingual performance and long
Gemini 2.5 Flash-Lite pricing
Providers
Gemini 2.5 Flash-Lite starts at $0.100 per million input tokens and $0.400 per million output tokens via Google.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency p95 s | Throughput P95 | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $0.100 | $0.400 | 1.0M | 65.5K | 0.44 | 6 c/s | — |
Gemini 2.5 Flash-Lite API
API access coming soon
Gemini 2.5 Flash-Lite will be available through our gateway shortly.
Gemini 2.5 Flash-Lite examples
Recent arena outputs from Gemini 2.5 Flash-Lite, picked from the highest-ranked matchups.
Gemini 2.5 Flash-Lite license
Gemini 2.5 Flash-Lite is released under the Creative Commons Attribution 4.0 License license, which restricts commercial use, has a knowledge cutoff of January 2025.
- License
- Creative Commons Attribution 4.0 License
- Non-commercial
- Knowledge cutoff
- January 2025
FAQ
Common questions about Gemini 2.5 Flash-Lite.