GoogleReleased on Mar 3, 2026

Gemini 3.1 Flash-Lite: Benchmarks, Pricing & Context Window

Gemini 3.1 Flash-Lite is a language model from Google, released in March 2026, with multimodal input.

Gemini 3.1 Flash-Lite is the first Flash-Lite model in the Gemini 3 series. It is optimized for high-volume, latency-sensitive tasks like translation, content moderation, and classification. It delivers enhanced performance at a fraction

Input
TextImageAudioVideo
Output
Text

Gemini 3.1 Flash-Lite pricing

Providers

Gemini 3.1 Flash-Lite starts at $0.250 per million input tokens and $1.50 per million output tokens via Google.

ProviderInput $/MOutput $/MMax InputMax OutputLatency sThroughputQuantInputOutput
Google logoGoogle
$0.250$1.501.0M65.5K

Gemini 3.1 Flash-Lite API

API access coming soon

Gemini 3.1 Flash-Lite will be available through our gateway shortly.

Gemini 3.1 Flash-Lite examples

Recent arena outputs from Gemini 3.1 Flash-Lite, picked from the highest-ranked matchups.

Gemini 3.1 Flash-Lite license

Gemini 3.1 Flash-Lite is released under the Proprietary license, which restricts commercial use, has a knowledge cutoff of January 2025.

License
Proprietary
Non-commercial
Knowledge cutoff
January 2025

Proprietary license - usage restrictions apply

FAQ

Common questions about Gemini 3.1 Flash-Lite.

What is the Gemini 3.1 Flash-Lite release date?

Gemini 3.1 Flash-Lite was released on March 3, 2026 by Google.

Who created Gemini 3.1 Flash-Lite?

Gemini 3.1 Flash-Lite was created by Google.

What is the license for Gemini 3.1 Flash-Lite?

Gemini 3.1 Flash-Lite is released under the Proprietary license.

What is the knowledge cutoff date for Gemini 3.1 Flash-Lite?

Gemini 3.1 Flash-Lite has a knowledge cutoff of January 2025. This means the model was trained on data up to this date and may not have information about events after this time.

Is Gemini 3.1 Flash-Lite multimodal?

Yes, Gemini 3.1 Flash-Lite is a multimodal model that can process both text and images as input.