Gemini 3 Flash
Overview
Overview
Gemini 3 Flash offers frontier intelligence built for speed at a fraction of the cost. It combines Gemini 3's Pro-grade reasoning with Flash-level latency, efficiency and cost. Features a 1 million-token input context window and is optimized for agentic workflows, coding, and complex analysis.
Gemini 3 Flash was released on December 17, 2025. API access is available through Google.
Performance
Timeline
ReleasedUnknown
Knowledge CutoffUnknown
Specifications
Parameters
Unknown
License
Proprietary
Training Data
Unknown
Benchmarks
Benchmarks
Gemini 3 Flash Performance Across Datasets
Scores sourced from the model's scorecard, paper, or official blog posts
Notice missing or incorrect data?Start an Issue discussion→
Pricing
Pricing
Pricing, performance, and capabilities for Gemini 3 Flash across different providers:
| Provider | Input ($/M) | Output ($/M) | Max Input | Max Output | Latency (s) | Throughput | Quantization | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
Google | $0.50 | $3.00 | 1.0M | 65.5K | — | — | — | Text Image Audio Video | Text Image Audio Video |
API Access
API Access Coming Soon
API access for Gemini 3 Flash will be available soon through our gateway.
Recent Posts
Recent Reviews
Blog Posts
FAQ
Common questions about Gemini 3 Flash
Gemini 3 Flash was released on December 17, 2025 by Google.
Gemini 3 Flash was created by Google.
Gemini 3 Flash is released under the Proprietary license.
Gemini 3 Flash has a knowledge cutoff of January 2025. This means the model was trained on data up to this date and may not have information about events after this time.
Yes, Gemini 3 Flash is a multimodal model that can process both text and images as input.
