Model Comparison

Kimi K2-Thinking-0905 vs Llama 4 Maverick

Kimi K2-Thinking-0905 significantly outperforms across most benchmarks. Llama 4 Maverick is 3.1x cheaper per token.

Performance Benchmarks

Comparative analysis across standard metrics

2 benchmarks

Kimi K2-Thinking-0905 outperforms in 2 benchmarks (GPQA, MMLU-Pro), while Llama 4 Maverick is better at 0 benchmarks.

Kimi K2-Thinking-0905 significantly outperforms across most benchmarks.

Mon Jun 01 2026 • llm-stats.com

Arena Performance

Human preference votes

Pricing Analysis

Price comparison per million tokens

Llama 4 Maverick costs less

For input processing, Kimi K2-Thinking-0905 ($0.47/1M tokens) is 2.8x more expensive than Llama 4 Maverick ($0.17/1M tokens).

For output processing, Kimi K2-Thinking-0905 ($2.00/1M tokens) is 3.3x more expensive than Llama 4 Maverick ($0.60/1M tokens).

In conclusion, Kimi K2-Thinking-0905 is more expensive than Llama 4 Maverick.*

* Using a 3:1 ratio of input to output tokens

Lowest available price from all providers
Mon Jun 01 2026 • llm-stats.com
Moonshot AI
Kimi K2-Thinking-0905
Input tokens$0.47
Output tokens$2.00
Best providerDeepinfra
Meta
Llama 4 Maverick
Input tokens$0.17
Output tokens$0.60
Best providerDeepinfra
Notice missing or incorrect data?Start an Issue

Model Size

Parameter count comparison

600.0B diff

Kimi K2-Thinking-0905 has 600.0B more parameters than Llama 4 Maverick, making it 150.0% larger.

Moonshot AI
Kimi K2-Thinking-0905
1.0Tparameters
Meta
Llama 4 Maverick
400.0Bparameters
1000.0B
Kimi K2-Thinking-0905
400.0B
Llama 4 Maverick

Context Window

Maximum input and output token capacity

Llama 4 Maverick accepts 1,000,000 input tokens compared to Kimi K2-Thinking-0905's 262,144 tokens. Llama 4 Maverick can generate longer responses up to 1,000,000 tokens, while Kimi K2-Thinking-0905 is limited to 262,144 tokens.

Moonshot AI
Kimi K2-Thinking-0905
Input262,144 tokens
Output262,144 tokens
Meta
Llama 4 Maverick
Input1,000,000 tokens
Output1,000,000 tokens
Mon Jun 01 2026 • llm-stats.com

Input Capabilities

Supported data types and modalities

Llama 4 Maverick supports multimodal inputs, whereas Kimi K2-Thinking-0905 does not.

Llama 4 Maverick can handle both text and other forms of data like images, making it suitable for multimodal applications.

Kimi K2-Thinking-0905

Text
Images
Audio
Video

Llama 4 Maverick

Text
Images
Audio
Video

License

Usage and distribution terms

Kimi K2-Thinking-0905 is licensed under MIT, while Llama 4 Maverick uses Llama 4 Community License Agreement.

License differences may affect how you can use these models in commercial or open-source projects.

Kimi K2-Thinking-0905

MIT

Open weights

Llama 4 Maverick

Llama 4 Community License Agreement

Open weights

Release Timeline

When each model was launched

Kimi K2-Thinking-0905 was released on 2025-09-05, while Llama 4 Maverick was released on 2025-04-05.

Kimi K2-Thinking-0905 is 5 months newer than Llama 4 Maverick.

Kimi K2-Thinking-0905

Sep 5, 2025

8 months ago

5mo newer
Llama 4 Maverick

Apr 5, 2025

1.2 years ago

Knowledge Cutoff

When training data ends

Neither model specifies a knowledge cutoff date.

Unable to compare the recency of their training data.

No cutoff dates available

Provider Availability

Kimi K2-Thinking-0905 is available from DeepInfra, Novita, Fireworks. Llama 4 Maverick is available from DeepInfra, Novita, Lambda, Groq, Fireworks, Together, Sambanova.

Kimi K2-Thinking-0905

deepinfra logo
Deepinfra
Input Price:Input: $0.47/1MOutput Price:Output: $2.00/1M
novita logo
Novita
Input Price:Input: $0.48/1MOutput Price:Output: $2.00/1M
fireworks logo
Fireworks
Input Price:Input: $0.60/1MOutput Price:Output: $2.50/1M

Llama 4 Maverick

deepinfra logo
Deepinfra
Input Price:Input: $0.17/1MOutput Price:Output: $0.60/1M
novita logo
Novita
Input Price:Input: $0.17/1MOutput Price:Output: $0.85/1M
lambda logo
Lambda
Input Price:Input: $0.18/1MOutput Price:Output: $0.60/1M
groq logo
Groq
Input Price:Input: $0.20/1MOutput Price:Output: $0.60/1M
fireworks logo
Fireworks
Input Price:Input: $0.22/1MOutput Price:Output: $0.88/1M
together logo
Together
Input Price:Input: $0.27/1MOutput Price:Output: $0.85/1M
sambanova logo
Sambanova
Input Price:Input: $0.63/1MOutput Price:Output: $1.79/1M
* Prices shown are per million tokens

Outputs Comparison

Notice missing or incorrect data?Start an Issue discussion

Key Takeaways

Higher GPQA score (84.5% vs 69.8%)
Higher MMLU-Pro score (84.6% vs 80.5%)
Larger context window (1,000,000 tokens)
Supports multimodal inputs
Less expensive input tokens
Less expensive output tokens

Detailed Comparison

AI Model Comparison Table
Feature
Moonshot AI
Kimi K2-Thinking-0905
Meta
Llama 4 Maverick

FAQ

Common questions about Kimi K2-Thinking-0905 vs Llama 4 Maverick.

Which is better, Kimi K2-Thinking-0905 or Llama 4 Maverick?

Kimi K2-Thinking-0905 significantly outperforms across most benchmarks. Kimi K2-Thinking-0905 is made by Moonshot AI and Llama 4 Maverick is made by Meta. The best choice depends on your use case — compare their benchmark scores, pricing, and capabilities above.

How does Kimi K2-Thinking-0905 compare to Llama 4 Maverick in benchmarks?

Kimi K2-Thinking-0905 scores AIME 2025: 100.0%, HMMT 2025: 97.5%, MMLU-Redux: 94.4%, FRAMES: 87.0%, MMLU-Pro: 84.6%. Llama 4 Maverick scores DocVQA: 94.4%, MGSM: 92.3%, ChartQA: 90.0%, MMLU: 85.5%, MMLU-Pro: 80.5%.

Is Kimi K2-Thinking-0905 cheaper than Llama 4 Maverick?

Llama 4 Maverick is 2.8x cheaper for input tokens. Kimi K2-Thinking-0905 costs $0.47/M input and $2.00/M output via deepinfra. Llama 4 Maverick costs $0.17/M input and $0.60/M output via deepinfra.

What are the context window sizes for Kimi K2-Thinking-0905 and Llama 4 Maverick?

Kimi K2-Thinking-0905 supports 262K tokens and Llama 4 Maverick supports 1.0M tokens. A larger context window lets you process longer documents, conversations, or codebases in a single request.

What are the main differences between Kimi K2-Thinking-0905 and Llama 4 Maverick?

Key differences include context window (262K vs 1.0M), input pricing ($0.47 vs $0.17/M), multimodal support (no vs yes), licensing (MIT vs Llama 4 Community License Agreement). See the full comparison above for benchmark-by-benchmark results.

Who makes Kimi K2-Thinking-0905 and Llama 4 Maverick?

Kimi K2-Thinking-0905 is developed by Moonshot AI and Llama 4 Maverick is developed by Meta.