Gemini 3 Flash
Overview
Gemini 3 Flash offers frontier intelligence built for speed at a fraction of the cost. It combines Gemini 3's Pro-grade reasoning with Flash-level latency, efficiency and cost. Features a 1 million-token input context window and is optimized for agentic workflows, coding, and complex analysis.
Gemini 3 Flash was released on December 17, 2025. API access is available through Google.
Performance
Timeline
Other Details
Related Models
Compare Gemini 3 Flash to other models by quality (GPQA score) vs cost. Higher scores and lower costs represent better value.
Performance visualization loading...
Gathering benchmark data from similar models
Benchmarks
Gemini 3 Flash Performance Across Datasets
Scores sourced from the model's scorecard, paper, or official blog posts
Pricing
Pricing, performance, and capabilities for Gemini 3 Flash across different providers:
| Provider | Input ($/M) | Output ($/M) | Max Input | Max Output | Latency (s) | Throughput | Quantization | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
Google | $0.50 | $3.00 | 1.0M | 65.5K | — | — | — | Text Image Audio Video | Text Image Audio Video |
Example Outputs
Recent Posts
Recent Reviews
API Access
API Access Coming Soon
API access for Gemini 3 Flash will be available soon through our gateway.
FAQ
Common questions about Gemini 3 Flash
