Model Comparison

GLM-4.5-Air vs Kimi K2 InstructWhich is better in 2026?

GLM-4.5-Air shows notably better performance in the majority of benchmarks.

Verdict: GLM-4.5-Air vs Kimi K2 Instruct — which is better?

GLM-4.5-Air (by Zhipu AI) and Kimi K2 Instruct (by Moonshot AI) are two of the AI models people compare most. Here is how they stack up on benchmarks, price and capabilities, and which one to pick in 2026.

GLM-4.5-Air outperforms in 4 benchmarks (AIME 2024, Humanity's Last Exam, MATH-500, MMLU-Pro), while Kimi K2 Instruct is better at 1 benchmark (GPQA). GLM-4.5-Air shows notably better performance in the majority of benchmarks.

Choose GLM-4.5-Air if…

you want the strongest raw capability — it leads on 4 of 6 shared benchmarks
you want the most recent training data — it shipped Jul 2025

Choose Kimi K2 Instruct if…

you want predictable pricing at $0.50/M input and $0.50/M output

Performance Benchmarks

Comparative analysis across standard metrics

6 benchmarks

GLM-4.5-Air outperforms in 4 benchmarks (AIME 2024, Humanity's Last Exam, MATH-500, MMLU-Pro), while Kimi K2 Instruct is better at 1 benchmark (GPQA).

GLM-4.5-Air shows notably better performance in the majority of benchmarks.

Sat Jun 27 2026 • llm-stats.com

Arena Performance

Human preference votes

Model Size

Parameter count comparison

894.0B diff

Kimi K2 Instruct has 894.0B more parameters than GLM-4.5-Air, making it 843.4% larger.

GLM-4.5-Air

106.0Bparameters

Kimi K2 Instruct

1.0Tparameters

106.0B

GLM-4.5-Air

1000.0B

Kimi K2 Instruct

Context Window

Maximum input and output token capacity

Only Kimi K2 Instruct specifies input context (200,000 tokens). Only Kimi K2 Instruct specifies output context (200,000 tokens).

GLM-4.5-Air

Input- tokens

Output- tokens

Kimi K2 Instruct

Input200,000 tokens

Output200,000 tokens

Sat Jun 27 2026 • llm-stats.com

License

Usage and distribution terms

Both models are licensed under MIT.

Both models share the same licensing terms, providing consistent usage rights.

GLM-4.5-Air

MIT

Open weights

Kimi K2 Instruct

MIT

Open weights

Release Timeline

When each model was launched

GLM-4.5-Air was released on 2025-07-28, while Kimi K2 Instruct was released on 2025-07-11.

GLM-4.5-Air is 1 month newer than Kimi K2 Instruct.

GLM-4.5-Air

Jul 28, 2025

11 months ago

2w newer

Kimi K2 Instruct

Jul 11, 2025

11 months ago

Knowledge Cutoff

When training data ends

Neither model specifies a knowledge cutoff date.

Unable to compare the recency of their training data.

No cutoff dates available

Outputs Comparison

Notice missing or incorrect data?Start an Issue discussion→

Key Takeaways

GLM-4.5-Air

View details

Zhipu AI

Higher AIME 2024 score (89.4% vs 69.6%)

Higher Humanity's Last Exam score (10.6% vs 4.7%)

Higher MATH-500 score (98.1% vs 97.4%)

Higher MMLU-Pro score (81.4% vs 81.1%)

Kimi K2 Instruct

View details

Moonshot AI

Larger context window (200,000 tokens)

Higher GPQA score (75.1% vs 75.0%)

Detailed Comparison

Interactive Arena

Judge for yourself.

Run your own prompts against GLM-4.5-Air and Kimi K2 Instruct side-by-side, then vote on the output you prefer.

GLM-4.5-Air

✓ Preferred

Kimi K2 Instruct

Open in Playground

AI Model Comparison Table
Feature	GLM-4.5-Air	Kimi K2 Instruct

FAQ

Common questions about GLM-4.5-Air vs Kimi K2 Instruct.

Which is better, GLM-4.5-Air or Kimi K2 Instruct?

GLM-4.5-Air shows notably better performance in the majority of benchmarks. GLM-4.5-Air is made by Zhipu AI and Kimi K2 Instruct is made by Moonshot AI. The best choice depends on your use case — compare their benchmark scores, pricing, and capabilities above.

How does GLM-4.5-Air compare to Kimi K2 Instruct in benchmarks?

GLM-4.5-Air scores MATH-500: 98.1%, AIME 2024: 89.4%, MMLU-Pro: 81.4%, TAU-bench Retail: 77.9%, BFCL-v3: 76.4%. Kimi K2 Instruct scores MATH-500: 97.4%, GSM8k: 97.3%, CBNSL: 95.6%, HumanEval: 93.3%, MMLU-Redux: 92.7%.

What are the context window sizes for GLM-4.5-Air and Kimi K2 Instruct?

GLM-4.5-Air supports an unknown number of tokens and Kimi K2 Instruct supports 200K tokens. A larger context window lets you process longer documents, conversations, or codebases in a single request.

Who makes GLM-4.5-Air and Kimi K2 Instruct?

GLM-4.5-Air is developed by Zhipu AI and Kimi K2 Instruct is developed by Moonshot AI.

GLM-4.5-Air vs Kimi K2 InstructWhich is better in 2026?

Verdict: GLM-4.5-Air vs Kimi K2 Instruct — which is better?

Choose GLM-4.5-Air if…

Choose Kimi K2 Instruct if…

Performance Benchmarks

Arena Performance

Model Size

Context Window

License

Release Timeline

Knowledge Cutoff

Outputs Comparison

Key Takeaways

GLM-4.5-Air

Kimi K2 Instruct

Detailed Comparison

Judge for yourself.

FAQ

Which is better, GLM-4.5-Air or Kimi K2 Instruct?

How does GLM-4.5-Air compare to Kimi K2 Instruct in benchmarks?

What are the context window sizes for GLM-4.5-Air and Kimi K2 Instruct?

Who makes GLM-4.5-Air and Kimi K2 Instruct?

More GLM-4.5-Air comparisons

More Kimi K2 Instruct comparisons

GLM-4.5-Air vs Kimi K2 InstructWhich is better in 2026?

Verdict: GLM-4.5-Air vs Kimi K2 Instruct — which is better?

Choose GLM-4.5-Air if…

Choose Kimi K2 Instruct if…

Performance Benchmarks

Arena Performance

Model Size

Context Window

License

Release Timeline

Knowledge Cutoff

Outputs Comparison

Key Takeaways

GLM-4.5-Air

Kimi K2 Instruct

Detailed Comparison

Judge for yourself.

Which is better, GLM-4.5-Air or Kimi K2 Instruct?

How does GLM-4.5-Air compare to Kimi K2 Instruct in benchmarks?

What are the context window sizes for GLM-4.5-Air and Kimi K2 Instruct?

Who makes GLM-4.5-Air and Kimi K2 Instruct?

Related comparisons

More GLM-4.5-Air comparisons

More Kimi K2 Instruct comparisons