Qwen3.5-397B-A17B vs Qwen3-235B-A22B-Thinking-2507 Comparison

Performance Benchmarks

Comparative analysis across standard metrics

11 benchmarks

Qwen3.5-397B-A17B outperforms in 11 benchmarks (GPQA, HMMT25, Humanity's Last Exam, IFEval, Include, LiveCodeBench v6, MMLU-Pro, MMLU-ProX, MMLU-Redux, PolyMATH, SuperGPQA), while Qwen3-235B-A22B-Thinking-2507 is better at 0 benchmarks.

Qwen3.5-397B-A17B significantly outperforms across most benchmarks.

Tue Mar 17 2026 • llm-stats.com

Arena Performance

Human preference votes

Pricing Analysis

Price comparison per million tokens

Qwen3-235B-A22B-Thinking-2507 costs less

For input processing, Qwen3.5-397B-A17B ($0.60/1M tokens) is 2.0x more expensive than Qwen3-235B-A22B-Thinking-2507 ($0.30/1M tokens).

For output processing, Qwen3.5-397B-A17B ($3.60/1M tokens) is 1.2x more expensive than Qwen3-235B-A22B-Thinking-2507 ($3.00/1M tokens).

In conclusion, Qwen3.5-397B-A17B is more expensive than Qwen3-235B-A22B-Thinking-2507.*

* Using a 3:1 ratio of input to output tokens

Lowest available price from all providers
Tue Mar 17 2026 • llm-stats.com
Alibaba Cloud / Qwen Team
Qwen3.5-397B-A17B
Input tokens$0.60
Output tokens$3.60
Best providerNovita
Alibaba Cloud / Qwen Team
Qwen3-235B-A22B-Thinking-2507
Input tokens$0.30
Output tokens$3.00
Best providerFireworks
Notice missing or incorrect data?Start an Issue

Model Size

Parameter count comparison

162.0B diff

Qwen3.5-397B-A17B has 162.0B more parameters than Qwen3-235B-A22B-Thinking-2507, making it 68.9% larger.

Alibaba Cloud / Qwen Team
Qwen3.5-397B-A17B
397.0Bparameters
Alibaba Cloud / Qwen Team
Qwen3-235B-A22B-Thinking-2507
235.0Bparameters
397.0B
Qwen3.5-397B-A17B
235.0B
Qwen3-235B-A22B-Thinking-2507

Context Window

Maximum input and output token capacity

Both models have the same input context window of 262,144 tokens. Qwen3-235B-A22B-Thinking-2507 can generate longer responses up to 131,072 tokens, while Qwen3.5-397B-A17B is limited to 64,000 tokens.

Alibaba Cloud / Qwen Team
Qwen3.5-397B-A17B
Input262,144 tokens
Output64,000 tokens
Alibaba Cloud / Qwen Team
Qwen3-235B-A22B-Thinking-2507
Input262,144 tokens
Output131,072 tokens
Tue Mar 17 2026 • llm-stats.com

Input Capabilities

Supported data types and modalities

Qwen3.5-397B-A17B supports multimodal inputs, whereas Qwen3-235B-A22B-Thinking-2507 does not.

Qwen3.5-397B-A17B can handle both text and other forms of data like images, making it suitable for multimodal applications.

Qwen3.5-397B-A17B

Text
Images
Audio
Video

Qwen3-235B-A22B-Thinking-2507

Text
Images
Audio
Video

License

Usage and distribution terms

Both models are licensed under Apache 2.0.

Both models share the same licensing terms, providing consistent usage rights.

Qwen3.5-397B-A17B

Apache 2.0

Open weights

Qwen3-235B-A22B-Thinking-2507

Apache 2.0

Open weights

Release Timeline

When each model was launched

Qwen3.5-397B-A17B was released on 2026-02-16, while Qwen3-235B-A22B-Thinking-2507 was released on 2025-07-25.

Qwen3.5-397B-A17B is 7 months newer than Qwen3-235B-A22B-Thinking-2507.

Qwen3.5-397B-A17B

Feb 16, 2026

4 weeks ago

6mo newer
Qwen3-235B-A22B-Thinking-2507

Jul 25, 2025

7 months ago

Knowledge Cutoff

When training data ends

Neither model specifies a knowledge cutoff date.

Unable to compare the recency of their training data.

No cutoff dates available

Provider Availability

Qwen3.5-397B-A17B is available from Novita. Qwen3-235B-A22B-Thinking-2507 is available from Fireworks, Novita. The availability of providers can affect quality of the model and reliability.

Qwen3.5-397B-A17B

novita logo
Novita
Input Price:Input: $0.60/1MOutput Price:Output: $3.60/1M

Qwen3-235B-A22B-Thinking-2507

fireworks logo
Fireworks
Input Price:Input: $0.30/1MOutput Price:Output: $3.00/1M
novita logo
Novita
Input Price:Input: $0.30/1MOutput Price:Output: $3.00/1M
* Prices shown are per million tokens

Outputs Comparison

Notice missing or incorrect data?Start an Issue discussion

Key Takeaways

Alibaba Cloud / Qwen Team

Qwen3.5-397B-A17B

View details

Alibaba Cloud / Qwen Team

Supports multimodal inputs
Higher GPQA score (88.4% vs 81.1%)
Higher HMMT25 score (92.7% vs 83.9%)
Higher Humanity's Last Exam score (28.7% vs 18.2%)
Higher IFEval score (92.6% vs 87.8%)
Higher Include score (85.6% vs 81.0%)
Higher LiveCodeBench v6 score (83.6% vs 74.1%)
Higher MMLU-Pro score (87.8% vs 84.4%)
Higher MMLU-ProX score (84.7% vs 81.0%)
Higher MMLU-Redux score (94.9% vs 93.8%)
Higher PolyMATH score (73.3% vs 60.1%)
Higher SuperGPQA score (70.4% vs 64.9%)
Less expensive input tokens
Less expensive output tokens

Detailed Comparison

AI Model Comparison Table
Feature
Alibaba Cloud / Qwen Team
Qwen3.5-397B-A17B
Alibaba Cloud / Qwen Team
Qwen3-235B-A22B-Thinking-2507