Qwen logo

Qwen3-Next-80B-A3B-Thinking

Qwen
qwen3-next-80b-a3b-thinkingVariant

Overview

Qwen3-Next-80B-A3B-Thinking is the thinking variant of the Qwen3-Next series, featuring the same groundbreaking architecture as the instruct model. Leveraging GSPO, it addresses stability and efficiency challenges of hybrid attention + high-sparsity MoE in RL training. It uses Hybrid Attention combining Gated DeltaNet and Gated Attention for efficient ultra-long context modeling, High-Sparsity MoE with 512 experts (10 activated + 1 shared), and Multi-Token Prediction. With 80B total parameters and only 3B activated, it demonstrates outstanding performance on complex reasoning tasks — outperforming Qwen3-30B-A3B-Thinking-2507, Qwen3-32B-Thinking, and even the proprietary Gemini-2.5-Flash-Thinking across multiple benchmarks. Architecture: 48 layers, 15T training tokens, hybrid layout of 12*(3*(Gated DeltaNet->MoE)->(Gated Attention->MoE)). Supports only thinking mode with automatic <think> tag inclusion, may generate longer thinking content.

Qwen3-Next-80B-A3B-Thinking was released on September 10, 2025. API access is available through Novita.

Performance

Timeline

Release DateUnknown
Knowledge CutoffUnknown

Other Details

Parameters
80.0B
License
Apache 2.0
Training Data
Unknown
Tags
language:enthinking:true

Related Models

Compare Qwen3-Next-80B-A3B-Thinking to other models by quality (GPQA score) vs cost. Higher scores and lower costs represent better value.

Performance visualization loading...

Gathering benchmark data from similar models

Benchmarks

Qwen3-Next-80B-A3B-Thinking Performance Across Datasets

Scores sourced from the model's scorecard, paper, or official blog posts

LLM Stats Logollm-stats.com - Sun Dec 14 2025
Notice missing or incorrect data?Start an Issue discussion

Pricing

Pricing, performance, and capabilities for Qwen3-Next-80B-A3B-Thinking across different providers:

ProviderInput ($/M)Output ($/M)Max InputMax OutputLatency (s)ThroughputQuantizationInputOutput
Novita logo
Novitabf16
$0.15$1.5065.5K65.5Kbf16
Text
Image
Audio
Video
Text
Image
Audio
Video

Example Outputs

Recent Posts

Recent Reviews

API Access

API Access Coming Soon

API access for Qwen3-Next-80B-A3B-Thinking will be available soon through our gateway.

FAQ

Common questions about Qwen3-Next-80B-A3B-Thinking

Qwen3-Next-80B-A3B-Thinking was released on September 10, 2025.
Qwen3-Next-80B-A3B-Thinking has 80.0 billion parameters.