Model Comparison

Sarvam-105B vs Qwen3-235B-A22B-Thinking-2507Which is better in 2026?

Qwen3-235B-A22B-Thinking-2507 shows notably better performance in the majority of benchmarks.

Verdict: Sarvam-105B vs Qwen3-235B-A22B-Thinking-2507 — which is better?

Sarvam-105B (by Sarvam AI) and Qwen3-235B-A22B-Thinking-2507 (by Alibaba Cloud / Qwen Team) are two of the AI models people compare most. Here is how they stack up on benchmarks, price and capabilities, and which one to pick in 2026.

Sarvam-105B outperforms in 2 benchmarks (AIME 2025, HMMT25), while Qwen3-235B-A22B-Thinking-2507 is better at 6 benchmarks (Arena-Hard v2, GPQA, Humanity's Last Exam, IFEval, LiveCodeBench v6, MMLU-Pro). Qwen3-235B-A22B-Thinking-2507 shows notably better performance in the majority of benchmarks.

Choose Sarvam-105B if…

you want the most recent training data — it shipped Mar 2026

Choose Qwen3-235B-A22B-Thinking-2507 if…

you want the strongest raw capability — it leads on 6 of 8 shared benchmarks

Want to compare interactively?Try the playground

Performance Benchmarks

Comparative analysis across standard metrics

8 benchmarks

Qwen3-235B-A22B-Thinking-2507 shows notably better performance in the majority of benchmarks.

Tue Jun 23 2026 • llm-stats.com

Arena Performance

Human preference votes

Model Size

Parameter count comparison

130.0B diff

Qwen3-235B-A22B-Thinking-2507 has 130.0B more parameters than Sarvam-105B, making it 123.8% larger.

Sarvam-105B

105.0Bparameters

Qwen3-235B-A22B-Thinking-2507

235.0Bparameters

105.0B

Sarvam-105B

235.0B

Qwen3-235B-A22B-Thinking-2507

Context Window

Maximum input and output token capacity

Only Qwen3-235B-A22B-Thinking-2507 specifies input context (262,144 tokens). Only Qwen3-235B-A22B-Thinking-2507 specifies output context (131,072 tokens).

Sarvam-105B

Input- tokens

Output- tokens

Qwen3-235B-A22B-Thinking-2507

Input262,144 tokens

Output131,072 tokens

Tue Jun 23 2026 • llm-stats.com

License

Usage and distribution terms

Both models are licensed under Apache 2.0.

Both models share the same licensing terms, providing consistent usage rights.

Sarvam-105B

Apache 2.0

Open weights

Qwen3-235B-A22B-Thinking-2507

Apache 2.0

Open weights

Release Timeline

When each model was launched

Sarvam-105B was released on 2026-03-06, while Qwen3-235B-A22B-Thinking-2507 was released on 2025-07-25.

Sarvam-105B is 7 months newer than Qwen3-235B-A22B-Thinking-2507.

Sarvam-105B

Mar 6, 2026

3 months ago

7mo newer

Qwen3-235B-A22B-Thinking-2507

Jul 25, 2025

11 months ago

Knowledge Cutoff

When training data ends

Neither model specifies a knowledge cutoff date.

Unable to compare the recency of their training data.

No cutoff dates available

Outputs Comparison

Notice missing or incorrect data?Start an Issue discussion→

Key Takeaways

Sarvam-105B

View details

Sarvam AI

Higher AIME 2025 score (96.7% vs 92.3%)

Higher HMMT25 score (85.8% vs 83.9%)

Qwen3-235B-A22B-Thinking-2507

View details

Alibaba Cloud / Qwen Team

Larger context window (262,144 tokens)

Higher Arena-Hard v2 score (79.7% vs 71.0%)

Higher GPQA score (81.1% vs 78.7%)

Higher Humanity's Last Exam score (18.2% vs 11.2%)

Higher IFEval score (87.8% vs 84.8%)

Higher LiveCodeBench v6 score (74.1% vs 71.7%)

Higher MMLU-Pro score (84.4% vs 81.7%)

Detailed Comparison

AI Model Comparison Table
Feature	Sarvam-105B	Qwen3-235B-A22B-Thinking-2507

FAQ

Common questions about Sarvam-105B vs Qwen3-235B-A22B-Thinking-2507.

Which is better, Sarvam-105B or Qwen3-235B-A22B-Thinking-2507?

Qwen3-235B-A22B-Thinking-2507 shows notably better performance in the majority of benchmarks. Sarvam-105B is made by Sarvam AI and Qwen3-235B-A22B-Thinking-2507 is made by Alibaba Cloud / Qwen Team. The best choice depends on your use case — compare their benchmark scores, pricing, and capabilities above.

How does Sarvam-105B compare to Qwen3-235B-A22B-Thinking-2507 in benchmarks?

Sarvam-105B scores MATH-500: 98.6%, AIME 2025: 96.7%, MMLU: 90.6%, HMMT 2025: 85.8%, HMMT25: 85.8%. Qwen3-235B-A22B-Thinking-2507 scores MMLU-Redux: 93.8%, AIME 2025: 92.3%, WritingBench: 88.3%, IFEval: 87.8%, Creative Writing v3: 86.1%.

What are the context window sizes for Sarvam-105B and Qwen3-235B-A22B-Thinking-2507?

Sarvam-105B supports an unknown number of tokens and Qwen3-235B-A22B-Thinking-2507 supports 262K tokens. A larger context window lets you process longer documents, conversations, or codebases in a single request.

Who makes Sarvam-105B and Qwen3-235B-A22B-Thinking-2507?

Sarvam-105B is developed by Sarvam AI and Qwen3-235B-A22B-Thinking-2507 is developed by Alibaba Cloud / Qwen Team.

Sarvam-105B vs Qwen3-235B-A22B-Thinking-2507Which is better in 2026?

Verdict: Sarvam-105B vs Qwen3-235B-A22B-Thinking-2507 — which is better?

Choose Sarvam-105B if…

Choose Qwen3-235B-A22B-Thinking-2507 if…

Performance Benchmarks

Arena Performance

Model Size

Context Window

License

Release Timeline

Knowledge Cutoff

Outputs Comparison

Key Takeaways

Sarvam-105B

Qwen3-235B-A22B-Thinking-2507

Detailed Comparison

FAQ

Which is better, Sarvam-105B or Qwen3-235B-A22B-Thinking-2507?

How does Sarvam-105B compare to Qwen3-235B-A22B-Thinking-2507 in benchmarks?

What are the context window sizes for Sarvam-105B and Qwen3-235B-A22B-Thinking-2507?

Who makes Sarvam-105B and Qwen3-235B-A22B-Thinking-2507?

More Sarvam-105B comparisons

More Qwen3-235B-A22B-Thinking-2507 comparisons

Sarvam-105B vs Qwen3-235B-A22B-Thinking-2507Which is better in 2026?

Verdict: Sarvam-105B vs Qwen3-235B-A22B-Thinking-2507 — which is better?

Choose Sarvam-105B if…

Choose Qwen3-235B-A22B-Thinking-2507 if…

Performance Benchmarks

Arena Performance

Model Size

Context Window

License

Release Timeline

Knowledge Cutoff

Outputs Comparison

Key Takeaways

Sarvam-105B

Qwen3-235B-A22B-Thinking-2507

Detailed Comparison

Which is better, Sarvam-105B or Qwen3-235B-A22B-Thinking-2507?

How does Sarvam-105B compare to Qwen3-235B-A22B-Thinking-2507 in benchmarks?

What are the context window sizes for Sarvam-105B and Qwen3-235B-A22B-Thinking-2507?

Who makes Sarvam-105B and Qwen3-235B-A22B-Thinking-2507?

Related comparisons

More Sarvam-105B comparisons

More Qwen3-235B-A22B-Thinking-2507 comparisons