Overview

GLM 4.7 is a coding‑centric model that thinks before acting, preserves its reasoning across turns, and lets you control thinking per request for speed or accuracy. It upgrades agentic workflows with stronger multi‑step tool use, better terminal and multilingual coding, and a noticeable jump in UI output quality for modern, clean webpages and slides. You can use it in popular coding agents, call it via the Z.ai API, and even run it locally with public weights on HuggingFace and ModelScope using vLLM or SGLang.

GLM-4.7 was released on December 22, 2025. API access is available through Novita.

Performance

Timeline

ReleasedUnknown

Knowledge CutoffUnknown

Specifications

Parameters

358.0B

License

MIT

Training Data

Unknown

Benchmarks

GLM-4.7 Performance Across Datasets

Scores sourced from the model's scorecard, paper, or official blog posts

llm-stats.com - Wed Dec 24 2025

Notice missing or incorrect data?Start an Issue discussion→

Pricing

Pricing, performance, and capabilities for GLM-4.7 across different providers:

Provider	Input ($/M)	Output ($/M)	Max Input	Max Output	Latency (s)	Throughput	Quantization	Input	Output
Novitabf16	$0.60	$2.20	204.8K	131.1K	—	—	bf16	Text Image Audio Video	Text Image Audio Video

API Access

API Access Coming Soon

API access for GLM-4.7 will be available soon through our gateway.

Recent Reviews

Blog Posts

Model ReleaseTechnical Deep Dive

GLM-4.7: Pricing, Benchmarks, and Full Model Analysis

A comprehensive look at Zhipu AI's GLM-4.7 — the flagship foundation model with 200K context window, 128K output capacity, MoE architecture, 'Vibe Coding' capabilities, and what it means for developers and enterprises.

Sebastian Crossa

Dec 22, 2025

FAQ

Common questions about GLM-4.7

GLM-4.7 was released on December 22, 2025.

GLM-4.7 has 358.0 billion parameters.

GLM-4.7