Model Comparison

Qwen3.5-397B-A17B vs Step-3.5-Flash

Both models are evenly matched across the benchmarks. Step-3.5-Flash is 7.7x cheaper per token.

Want to compare interactively?Try the playground

Performance Benchmarks

Comparative analysis across standard metrics

5 benchmarks

Qwen3.5-397B-A17B outperforms in 2 benchmarks (SWE-Bench Verified, Terminal-Bench 2.0), while Step-3.5-Flash is better at 2 benchmarks (IMO-AnswerBench, LiveCodeBench v6).

Both models are evenly matched across the benchmarks.

Fri May 22 2026 • llm-stats.com

Arena Performance

Human preference votes

Pricing Analysis

Price comparison per million tokens

Step-3.5-Flash costs less

For input processing, Qwen3.5-397B-A17B ($0.60/1M tokens) is 6.0x more expensive than Step-3.5-Flash ($0.10/1M tokens).

For output processing, Qwen3.5-397B-A17B ($3.60/1M tokens) is 9.0x more expensive than Step-3.5-Flash ($0.40/1M tokens).

In conclusion, Qwen3.5-397B-A17B is more expensive than Step-3.5-Flash.*

* Using a 3:1 ratio of input to output tokens

Lowest available price from all providers

Fri May 22 2026 • llm-stats.com

Qwen3.5-397B-A17B

Input tokens$0.60

Output tokens$3.60

Best providerNovita

Step-3.5-Flash

Input tokens$0.10

Output tokens$0.40

Best providerStepFun

Notice missing or incorrect data?Start an Issue→

Model Size

Parameter count comparison

201.0B diff

Qwen3.5-397B-A17B has 201.0B more parameters than Step-3.5-Flash, making it 102.6% larger.

Qwen3.5-397B-A17B

397.0Bparameters

Step-3.5-Flash

196.0Bparameters

397.0B

Qwen3.5-397B-A17B

196.0B

Step-3.5-Flash

Context Window

Maximum input and output token capacity

Qwen3.5-397B-A17B accepts 262,144 input tokens compared to Step-3.5-Flash's 65,536 tokens. Qwen3.5-397B-A17B can generate longer responses up to 64,000 tokens, while Step-3.5-Flash is limited to 8,192 tokens.

Qwen3.5-397B-A17B

Input262,144 tokens

Output64,000 tokens

Step-3.5-Flash

Input65,536 tokens

Output8,192 tokens

Fri May 22 2026 • llm-stats.com

Input Capabilities

Supported data types and modalities

Qwen3.5-397B-A17B supports multimodal inputs, whereas Step-3.5-Flash does not.

Qwen3.5-397B-A17B can handle both text and other forms of data like images, making it suitable for multimodal applications.

Qwen3.5-397B-A17B

Text

Images

Audio

Video

Step-3.5-Flash

Text

Images

Audio

Video

License

Usage and distribution terms

Both models are licensed under Apache 2.0.

Both models share the same licensing terms, providing consistent usage rights.

Qwen3.5-397B-A17B

Apache 2.0

Open weights

Step-3.5-Flash

Apache 2.0

Open weights

Release Timeline

When each model was launched

Qwen3.5-397B-A17B was released on 2026-02-16, while Step-3.5-Flash was released on 2026-02-02.

Qwen3.5-397B-A17B is 0 month newer than Step-3.5-Flash.

Qwen3.5-397B-A17B

Feb 16, 2026

3 months ago

2w newer

Step-3.5-Flash

Feb 2, 2026

3 months ago

Knowledge Cutoff

When training data ends

Neither model specifies a knowledge cutoff date.

Unable to compare the recency of their training data.

No cutoff dates available

Provider Availability

Qwen3.5-397B-A17B is available from Novita. Step-3.5-Flash is available from StepFun.

Qwen3.5-397B-A17B

Novita

Input Price:Input: $0.60/1MOutput Price:Output: $3.60/1M

Step-3.5-Flash

StepFun

Input Price:Input: $0.10/1MOutput Price:Output: $0.40/1M

* Prices shown are per million tokens

Outputs Comparison

Notice missing or incorrect data?Start an Issue discussion→

Key Takeaways

Qwen3.5-397B-A17B

View details

Alibaba Cloud / Qwen Team

Larger context window (262,144 tokens)

Supports multimodal inputs

Higher SWE-Bench Verified score (76.4% vs 74.4%)

Higher Terminal-Bench 2.0 score (52.5% vs 51.0%)

Step-3.5-Flash

View details

StepFun

Less expensive input tokens

Less expensive output tokens

Higher IMO-AnswerBench score (85.4% vs 80.9%)

Higher LiveCodeBench v6 score (86.4% vs 83.6%)

Detailed Comparison

AI Model Comparison Table
Feature	Qwen3.5-397B-A17B	Step-3.5-Flash

FAQ

Common questions about Qwen3.5-397B-A17B vs Step-3.5-Flash.

Which is better, Qwen3.5-397B-A17B or Step-3.5-Flash?

Both models are evenly matched across the benchmarks. Qwen3.5-397B-A17B is made by Alibaba Cloud / Qwen Team and Step-3.5-Flash is made by StepFun. The best choice depends on your use case — compare their benchmark scores, pricing, and capabilities above.

How does Qwen3.5-397B-A17B compare to Step-3.5-Flash in benchmarks?

Qwen3.5-397B-A17B scores MMLU-Redux: 94.9%, HMMT 2025: 94.8%, C-Eval: 93.0%, HMMT25: 92.7%, IFEval: 92.6%. Step-3.5-Flash scores AIME 2025: 97.3%, Tau-bench: 88.2%, LiveCodeBench v6: 86.4%, IMO-AnswerBench: 85.4%, SWE-Bench Verified: 74.4%.

Is Qwen3.5-397B-A17B cheaper than Step-3.5-Flash?

Step-3.5-Flash is 6.0x cheaper for input tokens. Qwen3.5-397B-A17B costs $0.60/M input and $3.60/M output via novita. Step-3.5-Flash costs $0.10/M input and $0.40/M output via stepfun.

What are the context window sizes for Qwen3.5-397B-A17B and Step-3.5-Flash?

Qwen3.5-397B-A17B supports 262K tokens and Step-3.5-Flash supports 66K tokens. A larger context window lets you process longer documents, conversations, or codebases in a single request.

What are the main differences between Qwen3.5-397B-A17B and Step-3.5-Flash?

Key differences include context window (262K vs 66K), input pricing ($0.60 vs $0.10/M), multimodal support (yes vs no). See the full comparison above for benchmark-by-benchmark results.

Who makes Qwen3.5-397B-A17B and Step-3.5-Flash?

Qwen3.5-397B-A17B is developed by Alibaba Cloud / Qwen Team and Step-3.5-Flash is developed by StepFun.