Google logo

Gemini 2.5 Flash-Lite

Overview

Overview

Gemini 2.5 Flash-Lite is a model developed by Google DeepMind, designed to handle various tasks including reasoning, science, mathematics, code generation, and more. It features advanced capabilities in multilingual performance and long context understanding. It is optimized for low latency use cases, supporting multimodal input with a 1 million-token context length.

Gemini 2.5 Flash-Lite was released on June 17, 2025. API access is available through Google.

Performance

Timeline

ReleasedUnknown
Knowledge CutoffUnknown

Specifications

Parameters
Unknown
License
Creative Commons Attribution 4.0 License
Training Data
Unknown

Benchmarks

Benchmarks

Gemini 2.5 Flash-Lite Performance Across Datasets

Scores sourced from the model's scorecard, paper, or official blog posts

LLM Stats Logollm-stats.com - Thu Feb 19 2026
Notice missing or incorrect data?Start an Issue discussion

Pricing

Pricing

Pricing, performance, and capabilities for Gemini 2.5 Flash-Lite across different providers:

ProviderInput ($/M)Output ($/M)Max InputMax OutputLatency (s)ThroughputQuantizationInputOutput
Google logo
Google
$0.10$0.401.0M65.5K
0.44
5.69 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video

API Access

API Access Coming Soon

API access for Gemini 2.5 Flash-Lite will be available soon through our gateway.

Recent Posts

Recent Reviews

FAQ

Common questions about Gemini 2.5 Flash-Lite

Gemini 2.5 Flash-Lite was released on June 17, 2025 by Google.
Gemini 2.5 Flash-Lite was created by Google.
Gemini 2.5 Flash-Lite is released under the Creative Commons Attribution 4.0 License license.
Gemini 2.5 Flash-Lite has a knowledge cutoff of January 2025. This means the model was trained on data up to this date and may not have information about events after this time.
Yes, Gemini 2.5 Flash-Lite is a multimodal model that can process both text and images as input.