- Organizations
- Qwen
- Qwen3-Next-80B-A3B-Thinking
Qwen3-Next-80B-A3B-Thinking: Benchmarks, Pricing & Context Window
Qwen3-Next-80B-A3B-Thinking is a language model from Qwen, released in September 2025.
Qwen3-Next-80B-A3B-Thinking is the thinking variant of the Qwen3-Next series, featuring the same groundbreaking architecture as the instruct model. Leveraging GSPO, it addresses stability and efficiency challenges of hybrid attention +
Qwen3-Next-80B-A3B-Thinking pricing
Providers
Qwen3-Next-80B-A3B-Thinking starts at $0.150 per million input tokens and $1.50 per million output tokens via Novita.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency s | Throughput | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $0.150 | $1.50 | 65.5K | 65.5K | — | — | bf16 |
Qwen3-Next-80B-A3B-Thinking API
API access coming soon
Qwen3-Next-80B-A3B-Thinking will be available through our gateway shortly.
Qwen3-Next-80B-A3B-Thinking examples
Recent arena outputs from Qwen3-Next-80B-A3B-Thinking, picked from the highest-ranked matchups.
Qwen3-Next-80B-A3B-Thinking license
Qwen3-Next-80B-A3B-Thinking is released under the Apache 2.0 license, which permits commercial use, has 80.0B parameters.
- License
- Apache 2.0
- Commercial use allowed
- Parameters
- 80.0B
Apache License 2.0 - allows commercial use
FAQ
Common questions about Qwen3-Next-80B-A3B-Thinking.