Model Comparison

GPT-5.2 vs Gemma 4 31B

GPT-5.2 significantly outperforms across most benchmarks.

Performance Benchmarks

Comparative analysis across standard metrics

4 benchmarks

GPT-5.2 outperforms in 4 benchmarks (GPQA, Humanity's Last Exam, MMMLU, MMMU-Pro), while Gemma 4 31B is better at 0 benchmarks.

GPT-5.2 significantly outperforms across most benchmarks.

Thu Apr 02 2026 • llm-stats.com

Arena Performance

Human preference votes

Pricing Analysis

Price comparison per million tokens

Cost data unavailable.

Lowest available price from all providers
Thu Apr 02 2026 • llm-stats.com
OpenAI
GPT-5.2
Input tokens$1.75
Output tokens$14.00
Best providerOpenAI
Google
Gemma 4 31B
Input tokens$0.00
Output tokens$0.00
Best providerUnknown Organization
Notice missing or incorrect data?Start an Issue

Context Window

Maximum input and output token capacity

Only GPT-5.2 specifies input context (400,000 tokens). Only GPT-5.2 specifies output context (128,000 tokens).

OpenAI
GPT-5.2
Input400,000 tokens
Output128,000 tokens
Google
Gemma 4 31B
Input- tokens
Output- tokens
Thu Apr 02 2026 • llm-stats.com

Input Capabilities

Supported data types and modalities

Both GPT-5.2 and Gemma 4 31B support multimodal inputs.

They are both capable of processing various types of data, offering versatility in application.

GPT-5.2

Text
Images
Audio
Video

Gemma 4 31B

Text
Images
Audio
Video

License

Usage and distribution terms

GPT-5.2 is licensed under a proprietary license, while Gemma 4 31B uses Apache 2.0.

License differences may affect how you can use these models in commercial or open-source projects.

GPT-5.2

Proprietary

Closed source

Gemma 4 31B

Apache 2.0

Open weights

Release Timeline

When each model was launched

GPT-5.2 was released on 2025-12-11, while Gemma 4 31B was released on 2026-04-02.

Gemma 4 31B is 4 months newer than GPT-5.2.

GPT-5.2

Dec 11, 2025

3 months ago

Gemma 4 31B

Apr 2, 2026

0 days ago

3mo newer

Knowledge Cutoff

When training data ends

GPT-5.2 has a documented knowledge cutoff of 2025-08-25, while Gemma 4 31B's cutoff date is not specified.

We can confirm GPT-5.2's training data extends to 2025-08-25, but cannot make a direct comparison without Gemma 4 31B's cutoff date.

GPT-5.2

Aug 2025

Gemma 4 31B

Outputs Comparison

Notice missing or incorrect data?Start an Issue discussion

Key Takeaways

Larger context window (400,000 tokens)
Higher GPQA score (92.4% vs 84.3%)
Higher Humanity's Last Exam score (34.5% vs 26.5%)
Higher MMMLU score (89.6% vs 88.4%)
Higher MMMU-Pro score (79.5% vs 76.9%)
Has open weights

Detailed Comparison

AI Model Comparison Table
Feature
OpenAI
GPT-5.2
Google
Gemma 4 31B

FAQ

Common questions about GPT-5.2 vs Gemma 4 31B

GPT-5.2 significantly outperforms across most benchmarks. GPT-5.2 is made by OpenAI and Gemma 4 31B is made by Google. The best choice depends on your use case — compare their benchmark scores, pricing, and capabilities above.
GPT-5.2 scores AIME 2025: 100.0%, HMMT 2025: 99.4%, Tau2 Telecom: 98.7%, Graphwalks BFS <128k: 94.0%, GPQA: 92.4%. Gemma 4 31B scores AIME 2026: 89.2%, MMMLU: 88.4%, t2-bench: 86.4%, MathVision: 85.6%, MMLU-Pro: 85.2%.
GPT-5.2 supports 400K tokens and Gemma 4 31B supports an unknown number of tokens. A larger context window lets you process longer documents, conversations, or codebases in a single request.
Key differences include licensing (Proprietary vs Apache 2.0). See the full comparison above for benchmark-by-benchmark results.
GPT-5.2 is developed by OpenAI and Gemma 4 31B is developed by Google.