Qwen logo

Qwen2.5 72B Instruct

Qwen
qwen-2.5-72b-instructVariant

Overview

Qwen2.5-72B-Instruct is an instruction-tuned 72 billion parameter language model, part of the Qwen2.5 series. It is designed to follow instructions, generate long texts (over 8K tokens), understand structured data (e.g., tables), and generate structured outputs, especially JSON. The model supports multilingual capabilities across over 29 languages.

Qwen2.5 72B Instruct was released on September 19, 2024. API access is available through 4 providers, including DeepInfra, Hyperbolic and others.

Performance

Timeline

Release DateUnknown
Knowledge CutoffUnknown

Other Details

Parameters
72.7B
License
Qwen
Training Data
Unknown
Tags
tuning:instruct

Related Models

Compare Qwen2.5 72B Instruct to other models by quality (GPQA score) vs cost. Higher scores and lower costs represent better value.

Performance visualization loading...

Gathering benchmark data from similar models

Benchmarks

Qwen2.5 72B Instruct Performance Across Datasets

Scores sourced from the model's scorecard, paper, or official blog posts

LLM Stats Logollm-stats.com - Sat Dec 06 2025
Notice missing or incorrect data?Start an Issue discussion

Pricing

Pricing, performance, and capabilities for Qwen2.5 72B Instruct across different providers:

ProviderInput ($/M)Output ($/M)Max InputMax OutputLatency (s)ThroughputQuantizationInputOutput
DeepInfra logo
DeepInfra
$0.35$0.40131.1K8.2K0.510.0 tok/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Hyperbolic logo
Hyperbolic
$0.40$0.40131.1K8.2K0.5100.0 tok/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Fireworks logo
Fireworks
$0.89$0.89131.1K8.2K0.3759.0 tok/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Together logo
Together
$1.20$1.20131.1K8.2K0.547.0 tok/s
Text
Image
Audio
Video
Text
Image
Audio
Video

Price Comparison for Qwen2.5 72B Instruct

Price per 1M input tokens (USD), lower is better

LLM Stats Logollm-stats.com - Sat Dec 06 2025

Throughput Comparison for Qwen2.5 72B Instruct

Tokens per second, higher is better

LLM Stats Logollm-stats.com - Sat Dec 06 2025

Latency Comparison for Qwen2.5 72B Instruct

Time to first token (s), lower is better

LLM Stats Logollm-stats.com - Sat Dec 06 2025

Qwen2.5 72B Instruct API Providers: Price vs Throughput

Example Outputs

Recent Posts

Recent Reviews

API Access

API Access Coming Soon

API access for Qwen2.5 72B Instruct will be available soon through our gateway.

FAQ

Common questions about Qwen2.5 72B Instruct

Qwen2.5 72B Instruct was released on September 19, 2024.
Qwen2.5 72B Instruct has 72.7 billion parameters.