- Organizations
- Gemini 3.1 Flash-Lite
Gemini 3.1 Flash-Lite: Benchmarks, Pricing & Context Window
Gemini 3.1 Flash-Lite is a language model from Google, released in March 2026, with multimodal input.
Gemini 3.1 Flash-Lite is the first Flash-Lite model in the Gemini 3 series. It is optimized for high-volume, latency-sensitive tasks like translation, content moderation, and classification. It delivers enhanced performance at a fraction
Gemini 3.1 Flash-Lite pricing
Providers
Gemini 3.1 Flash-Lite starts at $0.250 per million input tokens and $1.50 per million output tokens via Google.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency s | Throughput | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $0.250 | $1.50 | 1.0M | 65.5K | — | — | — |
Gemini 3.1 Flash-Lite API
API access coming soon
Gemini 3.1 Flash-Lite will be available through our gateway shortly.
Gemini 3.1 Flash-Lite examples
Recent arena outputs from Gemini 3.1 Flash-Lite, picked from the highest-ranked matchups.
Gemini 3.1 Flash-Lite license
Gemini 3.1 Flash-Lite is released under the Proprietary license, which restricts commercial use, has a knowledge cutoff of January 2025.
- License
- Proprietary
- Non-commercial
- Knowledge cutoff
- January 2025
Proprietary license - usage restrictions apply
FAQ
Common questions about Gemini 3.1 Flash-Lite.