- Organizations
- Qwen
- Qwen2.5 72B Instruct
Qwen2.5 72B Instruct: Benchmarks, Pricing & Context Window
Qwen2.5 72B Instruct is a language model from Qwen, released in September 2024.
Qwen2.5-72B-Instruct is an instruction-tuned 72 billion parameter language model, part of the Qwen2.5 series. It is designed to follow instructions, generate long texts (over 8K tokens), understand structured data (e.g., tables), and
Qwen2.5 72B Instruct pricing
Providers
Qwen2.5 72B Instruct starts at $0.350 per million input tokens and $0.400 per million output tokens via DeepInfra. See all 4 providers below with their per-token pricing, latency, throughput, and modality support.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency s | Throughput | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $0.350 | $0.400 | 131.1K | 8.2K | 0.50 | 10 c/s | — | |||
| $0.400 | $0.400 | 131.1K | 8.2K | 0.50 | 100 c/s | — | |||
| $0.890 | $0.890 | 131.1K | 8.2K | 0.37 | 59 c/s | — | |||
| $1.20 | $1.20 | 131.1K | 8.2K | 0.50 | 47 c/s | — |
Qwen2.5 72B Instruct API
API access coming soon
Qwen2.5 72B Instruct will be available through our gateway shortly.
Qwen2.5 72B Instruct examples
Recent arena outputs from Qwen2.5 72B Instruct, picked from the highest-ranked matchups.
Qwen2.5 72B Instruct license
Qwen2.5 72B Instruct is released under the Qwen license, which permits commercial use, has 72.7B parameters.
- License
- Qwen
- Commercial use allowed
- Parameters
- 72.7B
Alibaba Qwen License
FAQ
Common questions about Qwen2.5 72B Instruct.