GLM-5: Benchmarks, Pricing & Size
GLM-5 is a language model from ZAI, released in February 2026.
GLM-5 is Zhipu AI's flagship foundation model designed for complex system engineering and long-range Agent tasks, shifting focus from coding to engineering. It features 744B total parameters (40B activated) in a Mixture of Experts
GLM-5 pricing
Providers
GLM-5 starts at $1.00 per million input tokens and $3.20 per million output tokens via FriendliAI. See all 2 providers below with their per-token pricing, latency, throughput, and modality support.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency p95 s | Throughput P95 | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $1.00 | $3.20 | 200.0K | 128.0K | 6.06 | — | — | |||
| $1.00 | $3.20 | 200.0K | 128.0K | 12.09 | 30 c/s | — |
GLM-5 model size
GLM-5 has 744 billion parameters and was trained on 28.5 trillion tokens. See how it compares to other models in the same parameter range.
GLM-5 API
API access coming soon
GLM-5 will be available through our gateway shortly.
GLM-5 examples
Recent arena outputs from GLM-5, picked from the highest-ranked matchups.
GLM-5 license
GLM-5 is released under the MIT license, which permits commercial use, has 744.0B parameters.
- License
- MIT
- Commercial use allowed
- Parameters
- 744.0B
MIT License - allows commercial use
FAQ
Common questions about GLM-5.