Gemini 2.5 Flash
Overview
Overview
A thinking model designed for a balance between price and performance. It builds upon Gemini 2.0 Flash with upgraded reasoning, hybrid thinking control, multimodal capabilities (text, image, video, audio input), and a 1M token input context window.
Gemini 2.5 Flash was released on May 20, 2025. API access is available through Google.
Performance
Timeline
ReleasedUnknown
Knowledge CutoffUnknown
Specifications
Parameters
Unknown
License
Proprietary
Training Data
Unknown
Benchmarks
Benchmarks
Gemini 2.5 Flash Performance Across Datasets
Scores sourced from the model's scorecard, paper, or official blog posts
Notice missing or incorrect data?Start an Issue discussion→
Pricing
Pricing
Pricing, performance, and capabilities for Gemini 2.5 Flash across different providers:
| Provider | Input ($/M) | Output ($/M) | Max Input | Max Output | Latency (s) | Throughput | Quantization | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
Google | $0.30 | $2.50 | 1.0M | 65.5K | 0.7 | 85.0 c/s | — | Text Image Audio Video | Text Image Audio Video |
API Access
API Access Coming Soon
API access for Gemini 2.5 Flash will be available soon through our gateway.
Recent Posts
Recent Reviews
FAQ
Common questions about Gemini 2.5 Flash
Gemini 2.5 Flash was released on May 20, 2025 by Google.
Gemini 2.5 Flash was created by Google.
Gemini 2.5 Flash is released under the Proprietary license.
Gemini 2.5 Flash has a knowledge cutoff of January 2025. This means the model was trained on data up to this date and may not have information about events after this time.
Yes, Gemini 2.5 Flash is a multimodal model that can process both text and images as input.