Qwen logo

Qwen3-Next-80B-A3B-Thinking

Overview

Overview

Qwen3-Next-80B-A3B-Thinking is the thinking variant of the Qwen3-Next series, featuring the same groundbreaking architecture as the instruct model. Leveraging GSPO, it addresses stability and efficiency challenges of hybrid attention + high-sparsity MoE in RL training. It uses Hybrid Attention combining Gated DeltaNet and Gated Attention for efficient ultra-long context modeling, High-Sparsity MoE with 512 experts (10 activated + 1 shared), and Multi-Token Prediction. With 80B total parameters and only 3B activated, it demonstrates outstanding performance on complex reasoning tasks — outperforming Qwen3-30B-A3B-Thinking-2507, Qwen3-32B-Thinking, and even the proprietary Gemini-2.5-Flash-Thinking across multiple benchmarks. Architecture: 48 layers, 15T training tokens, hybrid layout of 12*(3*(Gated DeltaNet->MoE)->(Gated Attention->MoE)). Supports only thinking mode with automatic <think> tag inclusion, may generate longer thinking content.

Qwen3-Next-80B-A3B-Thinking was released on September 10, 2025. API access is available through Novita.

Performance

Timeline

ReleasedUnknown
Knowledge CutoffUnknown

Specifications

Parameters
80.0B
License
Apache 2.0
Training Data
Unknown
Tags
language:enthinking:true

Benchmarks

Benchmarks

Qwen3-Next-80B-A3B-Thinking Performance Across Datasets

Scores sourced from the model's scorecard, paper, or official blog posts

LLM Stats Logollm-stats.com - Sat Feb 07 2026
Notice missing or incorrect data?Start an Issue discussion

Pricing

Pricing

Pricing, performance, and capabilities for Qwen3-Next-80B-A3B-Thinking across different providers:

ProviderInput ($/M)Output ($/M)Max InputMax OutputLatency (s)ThroughputQuantizationInputOutput
Novita logo
Novitabf16
$0.15$1.5065.5K65.5K
bf16
Text
Image
Audio
Video
Text
Image
Audio
Video

API Access

API Access Coming Soon

API access for Qwen3-Next-80B-A3B-Thinking will be available soon through our gateway.

Recent Posts

Recent Reviews

FAQ

Common questions about Qwen3-Next-80B-A3B-Thinking

Qwen3-Next-80B-A3B-Thinking was released on September 10, 2025 by Qwen.
Qwen3-Next-80B-A3B-Thinking was created by Qwen.
Qwen3-Next-80B-A3B-Thinking has 80.0 billion parameters.
Qwen3-Next-80B-A3B-Thinking is released under the Apache 2.0 license. This is an open-source/open-weight license.