Model Comparison

Kimi K2 Instruct vs Qwen3-235B-A22B-Thinking-2507

Qwen3-235B-A22B-Thinking-2507 significantly outperforms across most benchmarks. Kimi K2 Instruct is 1.9x cheaper per token.

Performance Benchmarks

Comparative analysis across standard metrics

12 benchmarks

Kimi K2 Instruct outperforms in 2 benchmarks (IFEval, Tau2 Telecom), while Qwen3-235B-A22B-Thinking-2507 is better at 10 benchmarks (AIME 2025, GPQA, Humanity's Last Exam, LiveCodeBench v6, MMLU-Pro, MMLU-Redux, OJBench, SuperGPQA, Tau2 Airline, Tau2 Retail).

Qwen3-235B-A22B-Thinking-2507 significantly outperforms across most benchmarks.

Fri May 08 2026 • llm-stats.com

Arena Performance

Human preference votes

Pricing Analysis

Price comparison per million tokens

Kimi K2 Instruct costs less

For input processing, Kimi K2 Instruct ($0.50/1M tokens) is 1.7x more expensive than Qwen3-235B-A22B-Thinking-2507 ($0.30/1M tokens).

For output processing, Kimi K2 Instruct ($0.50/1M tokens) is 6.0x cheaper than Qwen3-235B-A22B-Thinking-2507 ($3.00/1M tokens).

In conclusion, Qwen3-235B-A22B-Thinking-2507 is more expensive than Kimi K2 Instruct.*

* Using a 3:1 ratio of input to output tokens

Lowest available price from all providers
Fri May 08 2026 • llm-stats.com
Moonshot AI
Kimi K2 Instruct
Input tokens$0.50
Output tokens$0.50
Best providerFireworks
Alibaba Cloud / Qwen Team
Qwen3-235B-A22B-Thinking-2507
Input tokens$0.30
Output tokens$3.00
Best providerFireworks
Notice missing or incorrect data?Start an Issue

Model Size

Parameter count comparison

765.0B diff

Kimi K2 Instruct has 765.0B more parameters than Qwen3-235B-A22B-Thinking-2507, making it 325.5% larger.

Moonshot AI
Kimi K2 Instruct
1.0Tparameters
Alibaba Cloud / Qwen Team
Qwen3-235B-A22B-Thinking-2507
235.0Bparameters
1000.0B
Kimi K2 Instruct
235.0B
Qwen3-235B-A22B-Thinking-2507

Context Window

Maximum input and output token capacity

Qwen3-235B-A22B-Thinking-2507 accepts 262,144 input tokens compared to Kimi K2 Instruct's 200,000 tokens. Kimi K2 Instruct can generate longer responses up to 200,000 tokens, while Qwen3-235B-A22B-Thinking-2507 is limited to 131,072 tokens.

Moonshot AI
Kimi K2 Instruct
Input200,000 tokens
Output200,000 tokens
Alibaba Cloud / Qwen Team
Qwen3-235B-A22B-Thinking-2507
Input262,144 tokens
Output131,072 tokens
Fri May 08 2026 • llm-stats.com

License

Usage and distribution terms

Kimi K2 Instruct is licensed under MIT, while Qwen3-235B-A22B-Thinking-2507 uses Apache 2.0.

License differences may affect how you can use these models in commercial or open-source projects.

Kimi K2 Instruct

MIT

Open weights

Qwen3-235B-A22B-Thinking-2507

Apache 2.0

Open weights

Release Timeline

When each model was launched

Kimi K2 Instruct was released on 2025-07-11, while Qwen3-235B-A22B-Thinking-2507 was released on 2025-07-25.

Qwen3-235B-A22B-Thinking-2507 is 0 month newer than Kimi K2 Instruct.

Kimi K2 Instruct

Jul 11, 2025

10 months ago

Qwen3-235B-A22B-Thinking-2507

Jul 25, 2025

9 months ago

2w newer

Knowledge Cutoff

When training data ends

Neither model specifies a knowledge cutoff date.

Unable to compare the recency of their training data.

No cutoff dates available

Provider Availability

Kimi K2 Instruct is available from Fireworks, Novita. Qwen3-235B-A22B-Thinking-2507 is available from Fireworks, Novita.

Kimi K2 Instruct

fireworks logo
Fireworks
Input Price:Input: $0.50/1MOutput Price:Output: $0.50/1M
novita logo
Novita
Input Price:Input: $0.57/1MOutput Price:Output: $2.30/1M

Qwen3-235B-A22B-Thinking-2507

fireworks logo
Fireworks
Input Price:Input: $0.30/1MOutput Price:Output: $3.00/1M
novita logo
Novita
Input Price:Input: $0.30/1MOutput Price:Output: $3.00/1M
* Prices shown are per million tokens

Outputs Comparison

Notice missing or incorrect data?Start an Issue discussion

Key Takeaways

Less expensive output tokens
Higher IFEval score (89.8% vs 87.8%)
Higher Tau2 Telecom score (65.8% vs 45.6%)
Larger context window (262,144 tokens)
Less expensive input tokens
Higher AIME 2025 score (92.3% vs 49.5%)
Higher GPQA score (81.1% vs 75.1%)
Higher Humanity's Last Exam score (18.2% vs 4.7%)
Higher LiveCodeBench v6 score (74.1% vs 53.7%)
Higher MMLU-Pro score (84.4% vs 81.1%)
Higher MMLU-Redux score (93.8% vs 92.7%)
Higher OJBench score (32.5% vs 27.1%)
Higher SuperGPQA score (64.9% vs 57.2%)
Higher Tau2 Airline score (58.0% vs 56.5%)
Higher Tau2 Retail score (71.9% vs 70.6%)

Detailed Comparison

AI Model Comparison Table
Feature
Moonshot AI
Kimi K2 Instruct
Alibaba Cloud / Qwen Team
Qwen3-235B-A22B-Thinking-2507

FAQ

Common questions about Kimi K2 Instruct vs Qwen3-235B-A22B-Thinking-2507.

Which is better, Kimi K2 Instruct or Qwen3-235B-A22B-Thinking-2507?

Qwen3-235B-A22B-Thinking-2507 significantly outperforms across most benchmarks. Kimi K2 Instruct is made by Moonshot AI and Qwen3-235B-A22B-Thinking-2507 is made by Alibaba Cloud / Qwen Team. The best choice depends on your use case — compare their benchmark scores, pricing, and capabilities above.

How does Kimi K2 Instruct compare to Qwen3-235B-A22B-Thinking-2507 in benchmarks?

Kimi K2 Instruct scores MATH-500: 97.4%, GSM8k: 97.3%, CBNSL: 95.6%, HumanEval: 93.3%, MMLU-Redux: 92.7%. Qwen3-235B-A22B-Thinking-2507 scores MMLU-Redux: 93.8%, AIME 2025: 92.3%, WritingBench: 88.3%, IFEval: 87.8%, Creative Writing v3: 86.1%.

Is Kimi K2 Instruct cheaper than Qwen3-235B-A22B-Thinking-2507?

Qwen3-235B-A22B-Thinking-2507 is 1.7x cheaper for input tokens. Kimi K2 Instruct costs $0.50/M input and $0.50/M output via fireworks. Qwen3-235B-A22B-Thinking-2507 costs $0.30/M input and $3.00/M output via fireworks.

What are the context window sizes for Kimi K2 Instruct and Qwen3-235B-A22B-Thinking-2507?

Kimi K2 Instruct supports 200K tokens and Qwen3-235B-A22B-Thinking-2507 supports 262K tokens. A larger context window lets you process longer documents, conversations, or codebases in a single request.

What are the main differences between Kimi K2 Instruct and Qwen3-235B-A22B-Thinking-2507?

Key differences include context window (200K vs 262K), input pricing ($0.50 vs $0.30/M), licensing (MIT vs Apache 2.0). See the full comparison above for benchmark-by-benchmark results.

Who makes Kimi K2 Instruct and Qwen3-235B-A22B-Thinking-2507?

Kimi K2 Instruct is developed by Moonshot AI and Qwen3-235B-A22B-Thinking-2507 is developed by Alibaba Cloud / Qwen Team.