Model Comparison

GLM-4.5-Air vs Granite 3.3 8B Base

GLM-4.5-Air significantly outperforms across most benchmarks.

Performance Benchmarks

Comparative analysis across standard metrics

2 benchmarks

GLM-4.5-Air outperforms in 2 benchmarks (AIME 2024, MATH-500), while Granite 3.3 8B Base is better at 0 benchmarks.

GLM-4.5-Air significantly outperforms across most benchmarks.

Mon May 25 2026 • llm-stats.com

Arena Performance

Human preference votes

Model Size

Parameter count comparison

97.8B diff

GLM-4.5-Air has 97.8B more parameters than Granite 3.3 8B Base, making it 1197.4% larger.

Zhipu AI
GLM-4.5-Air
106.0Bparameters
IBM
Granite 3.3 8B Base
8.2Bparameters
106.0B
GLM-4.5-Air
8.2B
Granite 3.3 8B Base

Input Capabilities

Supported data types and modalities

Granite 3.3 8B Base supports multimodal inputs, whereas GLM-4.5-Air does not.

Granite 3.3 8B Base can handle both text and other forms of data like images, making it suitable for multimodal applications.

GLM-4.5-Air

Text
Images
Audio
Video

Granite 3.3 8B Base

Text
Images
Audio
Video

License

Usage and distribution terms

GLM-4.5-Air is licensed under MIT, while Granite 3.3 8B Base uses Apache 2.0.

License differences may affect how you can use these models in commercial or open-source projects.

GLM-4.5-Air

MIT

Open weights

Granite 3.3 8B Base

Apache 2.0

Open weights

Release Timeline

When each model was launched

GLM-4.5-Air was released on 2025-07-28, while Granite 3.3 8B Base was released on 2025-04-16.

GLM-4.5-Air is 3 months newer than Granite 3.3 8B Base.

GLM-4.5-Air

Jul 28, 2025

10 months ago

3mo newer
Granite 3.3 8B Base

Apr 16, 2025

1.1 years ago

Knowledge Cutoff

When training data ends

Granite 3.3 8B Base has a documented knowledge cutoff of 2024-04-01, while GLM-4.5-Air's cutoff date is not specified.

We can confirm Granite 3.3 8B Base's training data extends to 2024-04-01, but cannot make a direct comparison without GLM-4.5-Air's cutoff date.

GLM-4.5-Air

Granite 3.3 8B Base

Apr 2024

Outputs Comparison

Notice missing or incorrect data?Start an Issue discussion

Key Takeaways

Higher AIME 2024 score (89.4% vs 81.2%)
Higher MATH-500 score (98.1% vs 69.0%)
Supports multimodal inputs

Detailed Comparison

AI Model Comparison Table
Feature
Zhipu AI
GLM-4.5-Air
IBM
Granite 3.3 8B Base

FAQ

Common questions about GLM-4.5-Air vs Granite 3.3 8B Base.

Which is better, GLM-4.5-Air or Granite 3.3 8B Base?

GLM-4.5-Air significantly outperforms across most benchmarks. GLM-4.5-Air is made by Zhipu AI and Granite 3.3 8B Base is made by IBM. The best choice depends on your use case — compare their benchmark scores, pricing, and capabilities above.

How does GLM-4.5-Air compare to Granite 3.3 8B Base in benchmarks?

GLM-4.5-Air scores MATH-500: 98.1%, AIME 2024: 89.4%, MMLU-Pro: 81.4%, TAU-bench Retail: 77.9%, BFCL-v3: 76.4%. Granite 3.3 8B Base scores HumanEval: 89.7%, AttaQ: 88.5%, HumanEval+: 86.1%, AIME 2024: 81.2%, HellaSwag: 80.1%.

What are the main differences between GLM-4.5-Air and Granite 3.3 8B Base?

Key differences include multimodal support (no vs yes), licensing (MIT vs Apache 2.0). See the full comparison above for benchmark-by-benchmark results.

Who makes GLM-4.5-Air and Granite 3.3 8B Base?

GLM-4.5-Air is developed by Zhipu AI and Granite 3.3 8B Base is developed by IBM.