Model Comparison

Grok-3 vs GPT-5 Medium

Both models are evenly matched across the benchmarks. GPT-5 Medium is 1.7x cheaper per token.

Performance Benchmarks

Comparative analysis across standard metrics

2 benchmarks

Grok-3 outperforms in 1 benchmarks (AIME 2025), while GPT-5 Medium is better at 1 benchmark (GPQA).

Both models are evenly matched across the benchmarks.

Sat May 09 2026 • llm-stats.com

Arena Performance

Human preference votes

Pricing Analysis

Price comparison per million tokens

GPT-5 Medium costs less

For input processing, Grok-3 ($3.00/1M tokens) is 2.4x more expensive than GPT-5 Medium ($1.25/1M tokens).

For output processing, Grok-3 ($15.00/1M tokens) is 1.5x more expensive than GPT-5 Medium ($10.00/1M tokens).

In conclusion, Grok-3 is more expensive than GPT-5 Medium.*

* Using a 3:1 ratio of input to output tokens

Lowest available price from all providers
Sat May 09 2026 • llm-stats.com
xAI
Grok-3
Input tokens$3.00
Output tokens$15.00
Best providerxAI
OpenAI
GPT-5 Medium
Input tokens$1.25
Output tokens$10.00
Best providerOpenAI
Notice missing or incorrect data?Start an Issue

Context Window

Maximum input and output token capacity

GPT-5 Medium accepts 400,000 input tokens compared to Grok-3's 128,000 tokens. GPT-5 Medium can generate longer responses up to 128,000 tokens, while Grok-3 is limited to 8,000 tokens.

xAI
Grok-3
Input128,000 tokens
Output8,000 tokens
OpenAI
GPT-5 Medium
Input400,000 tokens
Output128,000 tokens
Sat May 09 2026 • llm-stats.com

Input Capabilities

Supported data types and modalities

Both Grok-3 and GPT-5 Medium support multimodal inputs.

They are both capable of processing various types of data, offering versatility in application.

Grok-3

Text
Images
Audio
Video

GPT-5 Medium

Text
Images
Audio
Video

License

Usage and distribution terms

Both models are licensed under proprietary licenses.

Both models have usage restrictions defined by their respective organizations.

Grok-3

Proprietary

Closed source

GPT-5 Medium

Proprietary

Closed source

Release Timeline

When each model was launched

Grok-3 was released on 2025-02-17, while GPT-5 Medium was released on 2025-08-07.

GPT-5 Medium is 6 months newer than Grok-3.

Grok-3

Feb 17, 2025

1.2 years ago

GPT-5 Medium

Aug 7, 2025

9 months ago

5mo newer

Knowledge Cutoff

When training data ends

Grok-3 has a knowledge cutoff of 2024-11-17, while GPT-5 Medium has a cutoff of 2024-09-30.

Grok-3 has more recent training data (up to 2024-11-17), making it potentially better informed about events through that date compared to GPT-5 Medium (2024-09-30).

Grok-3

Nov 2024

2 mo newer
GPT-5 Medium

Sep 2024

Provider Availability

Grok-3 is available from xAI. GPT-5 Medium is available from OpenAI.

Grok-3

xai logo
xAI
Input Price:Input: $3.00/1MOutput Price:Output: $15.00/1M

GPT-5 Medium

openai logo
OpenAI
Input Price:Input: $1.25/1MOutput Price:Output: $10.00/1M
* Prices shown are per million tokens

Outputs Comparison

Notice missing or incorrect data?Start an Issue discussion

Key Takeaways

Higher AIME 2025 score (93.3% vs 88.9%)
Larger context window (400,000 tokens)
Less expensive input tokens
Less expensive output tokens
Higher GPQA score (88.1% vs 84.6%)
xAIGrok-3
OpenAIGPT-5 Medium

Detailed Comparison

AI Model Comparison Table
Feature
xAI
Grok-3
OpenAI
GPT-5 Medium

FAQ

Common questions about Grok-3 vs GPT-5 Medium.

Which is better, Grok-3 or GPT-5 Medium?

Both models are evenly matched across the benchmarks. Grok-3 is made by xAI and GPT-5 Medium is made by OpenAI. The best choice depends on your use case — compare their benchmark scores, pricing, and capabilities above.

How does Grok-3 compare to GPT-5 Medium in benchmarks?

Grok-3 scores AIME 2024: 93.3%, AIME 2025: 93.3%, GPQA: 84.6%, LiveCodeBench: 79.4%, MMMU: 78.0%. GPT-5 Medium scores AIME 2025: 88.9%, GPQA: 88.1%.

Is Grok-3 cheaper than GPT-5 Medium?

GPT-5 Medium is 2.4x cheaper for input tokens. Grok-3 costs $3.00/M input and $15.00/M output via xai. GPT-5 Medium costs $1.25/M input and $10.00/M output via openai.

What are the context window sizes for Grok-3 and GPT-5 Medium?

Grok-3 supports 128K tokens and GPT-5 Medium supports 400K tokens. A larger context window lets you process longer documents, conversations, or codebases in a single request.

What are the main differences between Grok-3 and GPT-5 Medium?

Key differences include context window (128K vs 400K), input pricing ($3.00 vs $1.25/M). See the full comparison above for benchmark-by-benchmark results.

Who makes Grok-3 and GPT-5 Medium?

Grok-3 is developed by xAI and GPT-5 Medium is developed by OpenAI.