Model Comparison

Kimi K2-Thinking-0905 vs Llama 4 Maverick

Kimi K2-Thinking-0905 significantly outperforms across most benchmarks. Llama 4 Maverick is 3.1x cheaper per token.

Want to compare interactively?Try the playground

Performance Benchmarks

Comparative analysis across standard metrics

2 benchmarks

Kimi K2-Thinking-0905 outperforms in 2 benchmarks (GPQA, MMLU-Pro), while Llama 4 Maverick is better at 0 benchmarks.

Kimi K2-Thinking-0905 significantly outperforms across most benchmarks.

Mon Jun 01 2026 • llm-stats.com

Arena Performance

Human preference votes

Pricing Analysis

Price comparison per million tokens

Llama 4 Maverick costs less

For input processing, Kimi K2-Thinking-0905 ($0.47/1M tokens) is 2.8x more expensive than Llama 4 Maverick ($0.17/1M tokens).

For output processing, Kimi K2-Thinking-0905 ($2.00/1M tokens) is 3.3x more expensive than Llama 4 Maverick ($0.60/1M tokens).

In conclusion, Kimi K2-Thinking-0905 is more expensive than Llama 4 Maverick.*

* Using a 3:1 ratio of input to output tokens

Lowest available price from all providers

Mon Jun 01 2026 • llm-stats.com

Kimi K2-Thinking-0905

Input tokens$0.47

Output tokens$2.00

Best providerDeepinfra

Llama 4 Maverick

Input tokens$0.17

Output tokens$0.60

Best providerDeepinfra

Notice missing or incorrect data?Start an Issue→

Model Size

Parameter count comparison

600.0B diff

Kimi K2-Thinking-0905 has 600.0B more parameters than Llama 4 Maverick, making it 150.0% larger.

Kimi K2-Thinking-0905

1.0Tparameters

Llama 4 Maverick

400.0Bparameters

1000.0B

Kimi K2-Thinking-0905

400.0B

Llama 4 Maverick

Context Window

Maximum input and output token capacity

Llama 4 Maverick accepts 1,000,000 input tokens compared to Kimi K2-Thinking-0905's 262,144 tokens. Llama 4 Maverick can generate longer responses up to 1,000,000 tokens, while Kimi K2-Thinking-0905 is limited to 262,144 tokens.

Kimi K2-Thinking-0905

Input262,144 tokens

Output262,144 tokens

Llama 4 Maverick

Input1,000,000 tokens

Output1,000,000 tokens

Mon Jun 01 2026 • llm-stats.com

Input Capabilities

Supported data types and modalities

Llama 4 Maverick supports multimodal inputs, whereas Kimi K2-Thinking-0905 does not.

Llama 4 Maverick can handle both text and other forms of data like images, making it suitable for multimodal applications.

Kimi K2-Thinking-0905

Text

Images

Audio

Video

Llama 4 Maverick

Text

Images

Audio

Video

License

Usage and distribution terms

Kimi K2-Thinking-0905 is licensed under MIT, while Llama 4 Maverick uses Llama 4 Community License Agreement.

License differences may affect how you can use these models in commercial or open-source projects.

Kimi K2-Thinking-0905

MIT

Open weights

Llama 4 Maverick

Llama 4 Community License Agreement

Open weights

Release Timeline

When each model was launched

Kimi K2-Thinking-0905 was released on 2025-09-05, while Llama 4 Maverick was released on 2025-04-05.

Kimi K2-Thinking-0905 is 5 months newer than Llama 4 Maverick.

Kimi K2-Thinking-0905

Sep 5, 2025

8 months ago

5mo newer

Llama 4 Maverick

Apr 5, 2025

1.2 years ago

Knowledge Cutoff

When training data ends

Neither model specifies a knowledge cutoff date.

Unable to compare the recency of their training data.

No cutoff dates available

Provider Availability

Kimi K2-Thinking-0905 is available from DeepInfra, Novita, Fireworks. Llama 4 Maverick is available from DeepInfra, Novita, Lambda, Groq, Fireworks, Together, Sambanova.

Kimi K2-Thinking-0905

Deepinfra

Input Price:Input: $0.47/1MOutput Price:Output: $2.00/1M

Novita

Input Price:Input: $0.48/1MOutput Price:Output: $2.00/1M

Fireworks

Input Price:Input: $0.60/1MOutput Price:Output: $2.50/1M

Llama 4 Maverick

Deepinfra

Input Price:Input: $0.17/1MOutput Price:Output: $0.60/1M

Novita

Input Price:Input: $0.17/1MOutput Price:Output: $0.85/1M

Lambda

Input Price:Input: $0.18/1MOutput Price:Output: $0.60/1M

Groq

Input Price:Input: $0.20/1MOutput Price:Output: $0.60/1M

Fireworks

Input Price:Input: $0.22/1MOutput Price:Output: $0.88/1M

Together

Input Price:Input: $0.27/1MOutput Price:Output: $0.85/1M

Sambanova

Input Price:Input: $0.63/1MOutput Price:Output: $1.79/1M

* Prices shown are per million tokens

Outputs Comparison

Notice missing or incorrect data?Start an Issue discussion→

Key Takeaways

Kimi K2-Thinking-0905

View details

Moonshot AI

Higher GPQA score (84.5% vs 69.8%)

Higher MMLU-Pro score (84.6% vs 80.5%)

Llama 4 Maverick

View details

Detailed Comparison

AI Model Comparison Table
Feature	Kimi K2-Thinking-0905	Llama 4 Maverick

FAQ

Common questions about Kimi K2-Thinking-0905 vs Llama 4 Maverick.

Which is better, Kimi K2-Thinking-0905 or Llama 4 Maverick?

Kimi K2-Thinking-0905 significantly outperforms across most benchmarks. Kimi K2-Thinking-0905 is made by Moonshot AI and Llama 4 Maverick is made by Meta. The best choice depends on your use case — compare their benchmark scores, pricing, and capabilities above.

How does Kimi K2-Thinking-0905 compare to Llama 4 Maverick in benchmarks?

Kimi K2-Thinking-0905 scores AIME 2025: 100.0%, HMMT 2025: 97.5%, MMLU-Redux: 94.4%, FRAMES: 87.0%, MMLU-Pro: 84.6%. Llama 4 Maverick scores DocVQA: 94.4%, MGSM: 92.3%, ChartQA: 90.0%, MMLU: 85.5%, MMLU-Pro: 80.5%.

Is Kimi K2-Thinking-0905 cheaper than Llama 4 Maverick?

Llama 4 Maverick is 2.8x cheaper for input tokens. Kimi K2-Thinking-0905 costs $0.47/M input and $2.00/M output via deepinfra. Llama 4 Maverick costs $0.17/M input and $0.60/M output via deepinfra.

What are the context window sizes for Kimi K2-Thinking-0905 and Llama 4 Maverick?

Kimi K2-Thinking-0905 supports 262K tokens and Llama 4 Maverick supports 1.0M tokens. A larger context window lets you process longer documents, conversations, or codebases in a single request.

What are the main differences between Kimi K2-Thinking-0905 and Llama 4 Maverick?

Key differences include context window (262K vs 1.0M), input pricing ($0.47 vs $0.17/M), multimodal support (no vs yes), licensing (MIT vs Llama 4 Community License Agreement). See the full comparison above for benchmark-by-benchmark results.

Who makes Kimi K2-Thinking-0905 and Llama 4 Maverick?

Kimi K2-Thinking-0905 is developed by Moonshot AI and Llama 4 Maverick is developed by Meta.