Gemini 2.5 Flash
Overview
A thinking model designed for a balance between price and performance. It builds upon Gemini 2.0 Flash with upgraded reasoning, hybrid thinking control, multimodal capabilities (text, image, video, audio input), and a 1M token input context window.
Gemini 2.5 Flash was released on May 20, 2025. API access is available through Google.
Performance
Timeline
ReleasedUnknown
Knowledge CutoffUnknown
Specifications
Parameters
Unknown
License
Proprietary
Training Data
Unknown
Benchmarks
Gemini 2.5 Flash Performance Across Datasets
Scores sourced from the model's scorecard, paper, or official blog posts
Notice missing or incorrect data?Start an Issue discussion→
Pricing
Pricing, performance, and capabilities for Gemini 2.5 Flash across different providers:
| Provider | Input ($/M) | Output ($/M) | Max Input | Max Output | Latency (s) | Throughput | Quantization | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
Google | $0.30 | $2.50 | 1.0M | 65.5K | 0.7 | 85.0 tok/s | — | Text Image Audio Video | Text Image Audio Video |
API Access
API Access Coming Soon
API access for Gemini 2.5 Flash will be available soon through our gateway.
Recent Posts
Recent Reviews
FAQ
Common questions about Gemini 2.5 Flash
Gemini 2.5 Flash was released on May 20, 2025.
