Model Comparison

Gemini 2.5 Pro Preview 06-05 vs o3

Gemini 2.5 Pro Preview 06-05 shows notably better performance in the majority of benchmarks.

Want to compare interactively?Try the playground

Performance Benchmarks

Comparative analysis across standard metrics

7 benchmarks

Gemini 2.5 Pro Preview 06-05 outperforms in 5 benchmarks (Aider-Polyglot, AIME 2025, GPQA, Humanity's Last Exam, VideoMMMU), while o3 is better at 2 benchmarks (MMMU, SWE-Bench Verified).

Gemini 2.5 Pro Preview 06-05 shows notably better performance in the majority of benchmarks.

Sat May 02 2026 • llm-stats.com

Arena Performance

Human preference votes

Context Window

Maximum input and output token capacity

Only Gemini 2.5 Pro Preview 06-05 specifies input context (1,048,576 tokens). Only Gemini 2.5 Pro Preview 06-05 specifies output context (65,535 tokens).

Gemini 2.5 Pro Preview 06-05

Input1,048,576 tokens

Output65,535 tokens

Input- tokens

Output- tokens

Sat May 02 2026 • llm-stats.com

Input Capabilities

Supported data types and modalities

Both Gemini 2.5 Pro Preview 06-05 and o3 support multimodal inputs.

They are both capable of processing various types of data, offering versatility in application.

Gemini 2.5 Pro Preview 06-05

Text

Images

Audio

Video

o3

Text

Images

Audio

Video

License

Usage and distribution terms

Both models are licensed under proprietary licenses.

Both models have usage restrictions defined by their respective organizations.

Gemini 2.5 Pro Preview 06-05

Proprietary

Closed source

Proprietary

Closed source

Release Timeline

When each model was launched

Gemini 2.5 Pro Preview 06-05 was released on 2025-06-05, while o3 was released on 2025-04-16.

Gemini 2.5 Pro Preview 06-05 is 2 months newer than o3.

Gemini 2.5 Pro Preview 06-05

Jun 5, 2025

11 months ago

1mo newer

Apr 16, 2025

1.0 years ago

Knowledge Cutoff

When training data ends

Gemini 2.5 Pro Preview 06-05 has a knowledge cutoff of 2025-01-31, while o3 has a cutoff of 2024-05-31.

Gemini 2.5 Pro Preview 06-05 has more recent training data (up to 2025-01-31), making it potentially better informed about events through that date compared to o3 (2024-05-31).

Gemini 2.5 Pro Preview 06-05

Jan 2025

8 mo newer

May 2024

Outputs Comparison

Notice missing or incorrect data?Start an Issue discussion→

Key Takeaways

Gemini 2.5 Pro Preview 06-05

View details

Google

Larger context window (1,048,576 tokens)

Higher Aider-Polyglot score (82.2% vs 81.3%)

Higher AIME 2025 score (88.0% vs 86.4%)

Higher GPQA score (86.4% vs 83.3%)

Higher Humanity's Last Exam score (21.6% vs 14.7%)

Higher VideoMMMU score (83.6% vs 83.3%)

o3

View details

OpenAI

Higher MMMU score (82.9% vs 82.0%)

Higher SWE-Bench Verified score (69.1% vs 67.2%)

Gemini 2.5 Pro Preview 06-05

Compare in Playground

Detailed Comparison

AI Model Comparison Table
Feature	Gemini 2.5 Pro Preview 06-05	o3

FAQ

Common questions about Gemini 2.5 Pro Preview 06-05 vs o3.

Which is better, Gemini 2.5 Pro Preview 06-05 or o3?

Gemini 2.5 Pro Preview 06-05 shows notably better performance in the majority of benchmarks. Gemini 2.5 Pro Preview 06-05 is made by Google and o3 is made by OpenAI. The best choice depends on your use case — compare their benchmark scores, pricing, and capabilities above.

How does Gemini 2.5 Pro Preview 06-05 compare to o3 in benchmarks?

Gemini 2.5 Pro Preview 06-05 scores Global-MMLU-Lite: 89.2%, AIME 2025: 88.0%, FACTS Grounding: 87.8%, GPQA: 86.4%, VideoMMMU: 83.6%. o3 scores COLLIE: 98.4%, AIME 2024: 91.6%, ARC-AGI: 88.0%, MathVista: 86.8%, AIME 2025: 86.4%.

What are the context window sizes for Gemini 2.5 Pro Preview 06-05 and o3?

Gemini 2.5 Pro Preview 06-05 supports 1.0M tokens and o3 supports an unknown number of tokens. A larger context window lets you process longer documents, conversations, or codebases in a single request.

Who makes Gemini 2.5 Pro Preview 06-05 and o3?

Gemini 2.5 Pro Preview 06-05 is developed by Google and o3 is developed by OpenAI.