Google logo

Gemini 2.5 Flash

Overview

Overview

A thinking model designed for a balance between price and performance. It builds upon Gemini 2.0 Flash with upgraded reasoning, hybrid thinking control, multimodal capabilities (text, image, video, audio input), and a 1M token input context window.

Gemini 2.5 Flash was released on May 20, 2025. API access is available through Google.

Performance

Timeline

ReleasedUnknown
Knowledge CutoffUnknown

Specifications

Parameters
Unknown
License
Proprietary
Training Data
Unknown

Benchmarks

Benchmarks

Gemini 2.5 Flash Performance Across Datasets

Scores sourced from the model's scorecard, paper, or official blog posts

LLM Stats Logollm-stats.com - Sun Jan 25 2026
Notice missing or incorrect data?Start an Issue discussion

Pricing

Pricing

Pricing, performance, and capabilities for Gemini 2.5 Flash across different providers:

ProviderInput ($/M)Output ($/M)Max InputMax OutputLatency (s)ThroughputQuantizationInputOutput
Google logo
Google
$0.30$2.501.0M65.5K
0.7
85.0 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video

API Access

API Access Coming Soon

API access for Gemini 2.5 Flash will be available soon through our gateway.

Recent Posts

Recent Reviews

FAQ

Common questions about Gemini 2.5 Flash

Gemini 2.5 Flash was released on May 20, 2025 by Google.
Gemini 2.5 Flash was created by Google.
Gemini 2.5 Flash is released under the Proprietary license.
Gemini 2.5 Flash has a knowledge cutoff of January 2025. This means the model was trained on data up to this date and may not have information about events after this time.
Yes, Gemini 2.5 Flash is a multimodal model that can process both text and images as input.