Name: GLM-4.7-Flash
Author: ZAI

Question 1

When was GLM-4.7-Flash released?

Accepted Answer

GLM-4.7-Flash was released on January 19, 2026 by ZAI. This is the official GLM-4.7-Flash release date tracked on LLM Stats.

Question 2

How much does GLM-4.7-Flash cost?

Accepted Answer

GLM-4.7-Flash pricing starts at $0.07 per million input tokens and $0.40 per million output tokens via ZAI, the lowest price among tracked providers.

Question 3

Is GLM-4.7-Flash available via API?

Accepted Answer

Yes, GLM-4.7-Flash is available via API. See the official documentation for authentication and endpoint details. It is served by 1 provider tracked on LLM Stats.

Question 4

How big is GLM-4.7-Flash?

Accepted Answer

GLM-4.7-Flash has 30 billion parameters. It ships as an open-weight model, so you can download and run it on your own hardware.

Question 5

Who created GLM-4.7-Flash?

Accepted Answer

GLM-4.7-Flash was created by ZAI.

Question 6

What is the license for GLM-4.7-Flash?

Accepted Answer

GLM-4.7-Flash is released under the MIT license. This is an open-source / open-weight license that permits self-hosting.

Question 7

What is GLM-4.7-Flash latency?

Accepted Answer

GLM-4.7-Flash p95 time to first token is 2.00 seconds via ZAI over the trailing 7 days. Lower time to first token means the model begins responding sooner for chat, agents and API workloads.

Question 8

Where can I use GLM-4.7-Flash?

Accepted Answer

GLM-4.7-Flash is available through 1 provider including ZAI.

Question 9

What models should I compare GLM-4.7-Flash against?

Accepted Answer

Common GLM-4.7-Flash comparisons include GLM-4.7-Flash vs Kimi K2 0905, GLM-4.7-Flash vs LongCat-Flash-Thinking, GLM-4.7-Flash vs LongCat-Flash-Chat. Compare them side by side for benchmark scores, pricing, context window, latency and API availability.

GLM-4.7-Flash: API Pricing, Context Window & Benchmarks

GLM-4.7-Flash benchmarks

Rankings

Quality Tracker

GLM-4.7-Flash Performance Across Datasets

GLM-4.7-Flash pricing

Providers

GLM-4.7-Flash model size

GLM-4.7-Flash context window

GLM-4.7-Flash API

GLM-4.7-Flash latency

GLM-4.7-Flash examples

GLM-4.7-Flash license

GLM-4.7-Flash resources

GLM-4.7-Flash vs other models

Models like GLM-4.7-Flash

FAQ