Name: Qwen2.5 72B Instruct
Author: Qwen

Question 1

When was Qwen2.5 72B Instruct released?

Accepted Answer

Qwen2.5 72B Instruct was released on September 19, 2024 by Qwen. This is the official Qwen2.5 72B Instruct release date tracked on LLM Stats.

Question 2

How much does Qwen2.5 72B Instruct cost?

Accepted Answer

Qwen2.5 72B Instruct pricing starts at $0.35 per million input tokens and $0.40 per million output tokens via DeepInfra, the lowest price among tracked providers.

Question 3

Is Qwen2.5 72B Instruct available via API?

Accepted Answer

Yes, Qwen2.5 72B Instruct is available via API. See the official documentation for authentication and endpoint details. It is served by 4 providers tracked on LLM Stats.

Question 4

How big is Qwen2.5 72B Instruct?

Accepted Answer

Qwen2.5 72B Instruct has 72.7 billion parameters. It was trained on 18.0 trillion tokens.

Question 5

Who created Qwen2.5 72B Instruct?

Accepted Answer

Qwen2.5 72B Instruct was created by Qwen.

Question 6

What is the license for Qwen2.5 72B Instruct?

Accepted Answer

Qwen2.5 72B Instruct is released under the Qwen license.

Question 7

What is Qwen2.5 72B Instruct latency?

Accepted Answer

Qwen2.5 72B Instruct p95 time to first token is 0.37 seconds via Fireworks over the trailing 7 days. Lower time to first token means the model begins responding sooner for chat, agents and API workloads.

Question 8

Where can I use Qwen2.5 72B Instruct?

Accepted Answer

Qwen2.5 72B Instruct is available through 4 providers including DeepInfra, Hyperbolic, Fireworks, and 1 more.

Question 9

Where is the Qwen2.5 72B Instruct paper or technical report?

Accepted Answer

Qwen2.5 72B Instruct has a paper or technical report available at https://qwenlm.github.io/blog/qwen2.5/. Use that source for architecture, training, release and evaluation details.

Question 10

What models should I compare Qwen2.5 72B Instruct against?

Accepted Answer

Common Qwen2.5 72B Instruct comparisons include Qwen2.5 72B Instruct vs Phi 4 Reasoning, Qwen2.5 72B Instruct vs DeepSeek R1 Distill Qwen 32B, Qwen2.5 72B Instruct vs Qwen2.5 32B Instruct. Compare them side by side for benchmark scores, pricing, context window, latency and API availability.

Provider	Input $/M	Output $/M	Context in / out	TTFT p50 / p95 s	Output avg / p5 c/s	Success 7d	Modalities in / out
DeepInfra	$0.350	$0.400	131.1K/8.2K	—/0.50	10/—	—	/
Hyperbolic	$0.400	$0.400	131.1K/8.2K	—/0.50	100/—	—	/
Fireworks	$0.890	$0.890	131.1K/8.2K	—/0.37	59/—	—	/
Together	$1.20	$1.20	131.1K/8.2K	—/0.50	47/—	—	/

Qwen2.5 72B Instruct: API Pricing, Context Window & Benchmarks

Qwen2.5 72B Instruct benchmarks

Rankings

Quality Tracker

Qwen2.5 72B Instruct Performance Across Datasets

Qwen2.5 72B Instruct pricing

Providers

Qwen2.5 72B Instruct context window