Model Comparison

GPT-4 Turbo vs Claude 3 Sonnet

GPT-4 Turbo significantly outperforms across most benchmarks. Claude 3 Sonnet is 2.5x cheaper per token.

Performance Benchmarks

Comparative analysis across standard metrics

6 benchmarks

GPT-4 Turbo outperforms in 6 benchmarks (DROP, GPQA, HumanEval, MATH, MGSM, MMLU), while Claude 3 Sonnet is better at 0 benchmarks.

GPT-4 Turbo significantly outperforms across most benchmarks.

Mon Apr 06 2026 • llm-stats.com

Arena Performance

Human preference votes

Pricing Analysis

Price comparison per million tokens

Claude 3 Sonnet costs less

For input processing, GPT-4 Turbo ($10.00/1M tokens) is 3.3x more expensive than Claude 3 Sonnet ($3.00/1M tokens).

For output processing, GPT-4 Turbo ($30.00/1M tokens) is 2.0x more expensive than Claude 3 Sonnet ($15.00/1M tokens).

In conclusion, GPT-4 Turbo is more expensive than Claude 3 Sonnet.*

* Using a 3:1 ratio of input to output tokens

Lowest available price from all providers
Mon Apr 06 2026 • llm-stats.com
OpenAI
GPT-4 Turbo
Input tokens$10.00
Output tokens$30.00
Best providerAzure
Anthropic
Claude 3 Sonnet
Input tokens$3.00
Output tokens$15.00
Best providerAnthropic
Notice missing or incorrect data?Start an Issue

Context Window

Maximum input and output token capacity

Claude 3 Sonnet accepts 200,000 input tokens compared to GPT-4 Turbo's 128,000 tokens. Claude 3 Sonnet can generate longer responses up to 200,000 tokens, while GPT-4 Turbo is limited to 4,096 tokens.

OpenAI
GPT-4 Turbo
Input128,000 tokens
Output4,096 tokens
Anthropic
Claude 3 Sonnet
Input200,000 tokens
Output200,000 tokens
Mon Apr 06 2026 • llm-stats.com

Input Capabilities

Supported data types and modalities

Claude 3 Sonnet supports multimodal inputs, whereas GPT-4 Turbo does not.

Claude 3 Sonnet can handle both text and other forms of data like images, making it suitable for multimodal applications.

GPT-4 Turbo

Text
Images
Audio
Video

Claude 3 Sonnet

Text
Images
Audio
Video

License

Usage and distribution terms

Both models are licensed under proprietary licenses.

Both models have usage restrictions defined by their respective organizations.

GPT-4 Turbo

Proprietary

Closed source

Claude 3 Sonnet

Proprietary

Closed source

Release Timeline

When each model was launched

GPT-4 Turbo was released on 2024-04-09, while Claude 3 Sonnet was released on 2024-02-29.

GPT-4 Turbo is 1 month newer than Claude 3 Sonnet.

GPT-4 Turbo

Apr 9, 2024

2.0 years ago

1mo newer
Claude 3 Sonnet

Feb 29, 2024

2.1 years ago

Knowledge Cutoff

When training data ends

GPT-4 Turbo has a documented knowledge cutoff of 2023-12-31, while Claude 3 Sonnet's cutoff date is not specified.

We can confirm GPT-4 Turbo's training data extends to 2023-12-31, but cannot make a direct comparison without Claude 3 Sonnet's cutoff date.

GPT-4 Turbo

Dec 2023

Claude 3 Sonnet

Provider Availability

GPT-4 Turbo is available from Azure, OpenAI. Claude 3 Sonnet is available from Anthropic, Bedrock, Google.

GPT-4 Turbo

azure logo
Azure
Input Price:Input: $10.00/1MOutput Price:Output: $30.00/1M
openai logo
OpenAI
Input Price:Input: $10.00/1MOutput Price:Output: $30.00/1M

Claude 3 Sonnet

anthropic logo
Anthropic
Input Price:Input: $3.00/1MOutput Price:Output: $15.00/1M
bedrock logo
AWS Bedrock
Input Price:Input: $3.00/1MOutput Price:Output: $15.00/1M
google logo
Google
Input Price:Input: $3.00/1MOutput Price:Output: $15.00/1M
* Prices shown are per million tokens

Outputs Comparison

Notice missing or incorrect data?Start an Issue discussion

Key Takeaways

Higher DROP score (86.0% vs 78.9%)
Higher GPQA score (48.0% vs 40.4%)
Higher HumanEval score (87.1% vs 73.0%)
Higher MATH score (72.6% vs 43.1%)
Higher MGSM score (88.5% vs 83.5%)
Higher MMLU score (86.5% vs 79.0%)
Larger context window (200,000 tokens)
Supports multimodal inputs
Less expensive input tokens
Less expensive output tokens

Detailed Comparison

AI Model Comparison Table
Feature
OpenAI
GPT-4 Turbo
Anthropic
Claude 3 Sonnet

FAQ

Common questions about GPT-4 Turbo vs Claude 3 Sonnet

GPT-4 Turbo significantly outperforms across most benchmarks. GPT-4 Turbo is made by OpenAI and Claude 3 Sonnet is made by Anthropic. The best choice depends on your use case — compare their benchmark scores, pricing, and capabilities above.
GPT-4 Turbo scores MGSM: 88.5%, HumanEval: 87.1%, MMLU: 86.5%, DROP: 86.0%, MATH: 72.6%. Claude 3 Sonnet scores ARC-C: 93.2%, GSM8k: 92.3%, HellaSwag: 89.0%, MGSM: 83.5%, BIG-Bench Hard: 82.9%.
Claude 3 Sonnet is 3.3x cheaper for input tokens. GPT-4 Turbo costs $10.00/M input and $30.00/M output via azure. Claude 3 Sonnet costs $3.00/M input and $15.00/M output via anthropic.
GPT-4 Turbo supports 128K tokens and Claude 3 Sonnet supports 200K tokens. A larger context window lets you process longer documents, conversations, or codebases in a single request.
Key differences include context window (128K vs 200K), input pricing ($10.00 vs $3.00/M), multimodal support (no vs yes). See the full comparison above for benchmark-by-benchmark results.
GPT-4 Turbo is developed by OpenAI and Claude 3 Sonnet is developed by Anthropic.