Model Comparison

GLM-4.5-Air vs Kimi K2 InstructWhich is better in 2026?

GLM-4.5-Air shows notably better performance in the majority of benchmarks.

Verdict: GLM-4.5-Air vs Kimi K2 Instruct — which is better?

GLM-4.5-Air (by Zhipu AI) and Kimi K2 Instruct (by Moonshot AI) are two of the AI models people compare most. Here is how they stack up on benchmarks, price and capabilities, and which one to pick in 2026.

GLM-4.5-Air outperforms in 4 benchmarks (AIME 2024, Humanity's Last Exam, MATH-500, MMLU-Pro), while Kimi K2 Instruct is better at 1 benchmark (GPQA). GLM-4.5-Air shows notably better performance in the majority of benchmarks.

Choose GLM-4.5-Air if…

  • you want the strongest raw capability — it leads on 4 of 6 shared benchmarks
  • you want the most recent training data — it shipped Jul 2025

Choose Kimi K2 Instruct if…

  • you want predictable pricing at $0.50/M input and $0.50/M output

Performance Benchmarks

Comparative analysis across standard metrics

6 benchmarks

GLM-4.5-Air outperforms in 4 benchmarks (AIME 2024, Humanity's Last Exam, MATH-500, MMLU-Pro), while Kimi K2 Instruct is better at 1 benchmark (GPQA).

GLM-4.5-Air shows notably better performance in the majority of benchmarks.

Sun Jun 14 2026 • llm-stats.com

Arena Performance

Human preference votes

Model Size

Parameter count comparison

894.0B diff

Kimi K2 Instruct has 894.0B more parameters than GLM-4.5-Air, making it 843.4% larger.

Zhipu AI
GLM-4.5-Air
106.0Bparameters
Moonshot AI
Kimi K2 Instruct
1.0Tparameters
106.0B
GLM-4.5-Air
1000.0B
Kimi K2 Instruct

Context Window

Maximum input and output token capacity

Only Kimi K2 Instruct specifies input context (200,000 tokens). Only Kimi K2 Instruct specifies output context (200,000 tokens).

Zhipu AI
GLM-4.5-Air
Input- tokens
Output- tokens
Moonshot AI
Kimi K2 Instruct
Input200,000 tokens
Output200,000 tokens
Sun Jun 14 2026 • llm-stats.com

License

Usage and distribution terms

Both models are licensed under MIT.

Both models share the same licensing terms, providing consistent usage rights.

GLM-4.5-Air

MIT

Open weights

Kimi K2 Instruct

MIT

Open weights

Release Timeline

When each model was launched

GLM-4.5-Air was released on 2025-07-28, while Kimi K2 Instruct was released on 2025-07-11.

GLM-4.5-Air is 1 month newer than Kimi K2 Instruct.

GLM-4.5-Air

Jul 28, 2025

10 months ago

2w newer
Kimi K2 Instruct

Jul 11, 2025

11 months ago

Knowledge Cutoff

When training data ends

Neither model specifies a knowledge cutoff date.

Unable to compare the recency of their training data.

No cutoff dates available

Outputs Comparison

Notice missing or incorrect data?Start an Issue discussion

Key Takeaways

Higher AIME 2024 score (89.4% vs 69.6%)
Higher Humanity's Last Exam score (10.6% vs 4.7%)
Higher MATH-500 score (98.1% vs 97.4%)
Higher MMLU-Pro score (81.4% vs 81.1%)
Larger context window (200,000 tokens)
Higher GPQA score (75.1% vs 75.0%)

Detailed Comparison

AI Model Comparison Table
Feature
Zhipu AI
GLM-4.5-Air
Moonshot AI
Kimi K2 Instruct

FAQ

Common questions about GLM-4.5-Air vs Kimi K2 Instruct.

Which is better, GLM-4.5-Air or Kimi K2 Instruct?

GLM-4.5-Air shows notably better performance in the majority of benchmarks. GLM-4.5-Air is made by Zhipu AI and Kimi K2 Instruct is made by Moonshot AI. The best choice depends on your use case — compare their benchmark scores, pricing, and capabilities above.

How does GLM-4.5-Air compare to Kimi K2 Instruct in benchmarks?

GLM-4.5-Air scores MATH-500: 98.1%, AIME 2024: 89.4%, MMLU-Pro: 81.4%, TAU-bench Retail: 77.9%, BFCL-v3: 76.4%. Kimi K2 Instruct scores MATH-500: 97.4%, GSM8k: 97.3%, CBNSL: 95.6%, HumanEval: 93.3%, MMLU-Redux: 92.7%.

What are the context window sizes for GLM-4.5-Air and Kimi K2 Instruct?

GLM-4.5-Air supports an unknown number of tokens and Kimi K2 Instruct supports 200K tokens. A larger context window lets you process longer documents, conversations, or codebases in a single request.

Who makes GLM-4.5-Air and Kimi K2 Instruct?

GLM-4.5-Air is developed by Zhipu AI and Kimi K2 Instruct is developed by Moonshot AI.