Model Comparison

Qwen3.5-397B-A17B vs MAI-Thinking-1

Qwen3.5-397B-A17B significantly outperforms across most benchmarks.

Performance Benchmarks

Comparative analysis across standard metrics

9 benchmarks

Qwen3.5-397B-A17B outperforms in 7 benchmarks (GPQA, IFBench, LongBench v2, MMLU-Pro, Multi-Challenge, SWE-Bench Verified, Terminal-Bench 2.0), while MAI-Thinking-1 is better at 2 benchmarks (AIME 2026, LiveCodeBench v6).

Qwen3.5-397B-A17B significantly outperforms across most benchmarks.

Sat Jun 06 2026 • llm-stats.com

Arena Performance

Human preference votes

Model Size

Parameter count comparison

603.0B diff

MAI-Thinking-1 has 603.0B more parameters than Qwen3.5-397B-A17B, making it 151.9% larger.

Alibaba Cloud / Qwen Team
Qwen3.5-397B-A17B
397.0Bparameters
Microsoft
MAI-Thinking-1
1.0Tparameters
397.0B
Qwen3.5-397B-A17B
1000.0B
MAI-Thinking-1

Context Window

Maximum input and output token capacity

Only Qwen3.5-397B-A17B specifies input context (262,144 tokens). Only Qwen3.5-397B-A17B specifies output context (64,000 tokens).

Alibaba Cloud / Qwen Team
Qwen3.5-397B-A17B
Input262,144 tokens
Output64,000 tokens
Microsoft
MAI-Thinking-1
Input- tokens
Output- tokens
Sat Jun 06 2026 • llm-stats.com

Input Capabilities

Supported data types and modalities

Qwen3.5-397B-A17B supports multimodal inputs, whereas MAI-Thinking-1 does not.

Qwen3.5-397B-A17B can handle both text and other forms of data like images, making it suitable for multimodal applications.

Qwen3.5-397B-A17B

Text
Images
Audio
Video

MAI-Thinking-1

Text
Images
Audio
Video

License

Usage and distribution terms

Qwen3.5-397B-A17B is licensed under Apache 2.0, while MAI-Thinking-1 uses a proprietary license.

License differences may affect how you can use these models in commercial or open-source projects.

Qwen3.5-397B-A17B

Apache 2.0

Open weights

MAI-Thinking-1

Proprietary

Closed source

Release Timeline

When each model was launched

Qwen3.5-397B-A17B was released on 2026-02-16, while MAI-Thinking-1 was released on 2026-06-02.

MAI-Thinking-1 is 4 months newer than Qwen3.5-397B-A17B.

Qwen3.5-397B-A17B

Feb 16, 2026

3 months ago

MAI-Thinking-1

Jun 2, 2026

4 days ago

3mo newer

Knowledge Cutoff

When training data ends

Neither model specifies a knowledge cutoff date.

Unable to compare the recency of their training data.

No cutoff dates available

Outputs Comparison

Notice missing or incorrect data?Start an Issue discussion

Key Takeaways

Alibaba Cloud / Qwen Team

Qwen3.5-397B-A17B

View details

Alibaba Cloud / Qwen Team

Larger context window (262,144 tokens)
Supports multimodal inputs
Has open weights
Higher GPQA score (88.4% vs 84.2%)
Higher IFBench score (76.5% vs 69.0%)
Higher LongBench v2 score (63.2% vs 61.0%)
Higher MMLU-Pro score (87.8% vs 85.0%)
Higher Multi-Challenge score (67.6% vs 53.0%)
Higher SWE-Bench Verified score (76.4% vs 73.5%)
Higher Terminal-Bench 2.0 score (52.5% vs 46.0%)
Higher AIME 2026 score (94.5% vs 91.3%)
Higher LiveCodeBench v6 score (87.7% vs 83.6%)

Detailed Comparison

AI Model Comparison Table
Feature
Alibaba Cloud / Qwen Team
Qwen3.5-397B-A17B
Microsoft
MAI-Thinking-1

FAQ

Common questions about Qwen3.5-397B-A17B vs MAI-Thinking-1.

Which is better, Qwen3.5-397B-A17B or MAI-Thinking-1?

Qwen3.5-397B-A17B significantly outperforms across most benchmarks. Qwen3.5-397B-A17B is made by Alibaba Cloud / Qwen Team and MAI-Thinking-1 is made by Microsoft. The best choice depends on your use case — compare their benchmark scores, pricing, and capabilities above.

How does Qwen3.5-397B-A17B compare to MAI-Thinking-1 in benchmarks?

Qwen3.5-397B-A17B scores MMLU-Redux: 94.9%, HMMT 2025: 94.8%, C-Eval: 93.0%, HMMT25: 92.7%, IFEval: 92.6%. MAI-Thinking-1 scores LongFact: 98.0%, AIME 2025: 97.0%, AIME 2026: 94.5%, GraphWalks: 90.0%, AIR-Bench: 88.0%.

What are the context window sizes for Qwen3.5-397B-A17B and MAI-Thinking-1?

Qwen3.5-397B-A17B supports 262K tokens and MAI-Thinking-1 supports an unknown number of tokens. A larger context window lets you process longer documents, conversations, or codebases in a single request.

What are the main differences between Qwen3.5-397B-A17B and MAI-Thinking-1?

Key differences include multimodal support (yes vs no), licensing (Apache 2.0 vs Proprietary). See the full comparison above for benchmark-by-benchmark results.

Who makes Qwen3.5-397B-A17B and MAI-Thinking-1?

Qwen3.5-397B-A17B is developed by Alibaba Cloud / Qwen Team and MAI-Thinking-1 is developed by Microsoft.