Gemini 2.5 Flash
GoogleOverview
A thinking model designed for a balance between price and performance. It builds upon Gemini 2.0 Flash with upgraded reasoning, hybrid thinking control, multimodal capabilities (text, image, video, audio input), and a 1M token input context window.
Gemini 2.5 Flash was released on May 20, 2025. API access is available through Google.
Performance
Timeline
Other Details
Related Models
Compare Gemini 2.5 Flash to other models by quality (GPQA score) vs cost. Higher scores and lower costs represent better value.
Performance visualization loading...
Gathering benchmark data from similar models
Benchmarks
Gemini 2.5 Flash Performance Across Datasets
Scores sourced from the model's scorecard, paper, or official blog posts
Pricing
Pricing, performance, and capabilities for Gemini 2.5 Flash across different providers:
| Provider | Input ($/M) | Output ($/M) | Max Input | Max Output | Latency (s) | Throughput | Quantization | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
Google | $0.30 | $2.50 | 1.0M | 65.5K | 0.7 | 85.0 tok/s | — | Text Image Audio Video | Text Image Audio Video |
Example Outputs
Recent Posts
Recent Reviews
API Access
API Access Coming Soon
API access for Gemini 2.5 Flash will be available soon through our gateway.
FAQ
Common questions about Gemini 2.5 Flash
