- Organizations
- Qwen
- Qwen3-235B-A22B-Thinking-2507
Qwen3-235B-A22B-Thinking-2507: Benchmarks, Pricing & Context Window
Qwen3-235B-A22B-Thinking-2507 is a language model from Qwen, released in July 2025.
Qwen3-235B-A22B-Thinking-2507 is a state-of-the-art thinking-enabled Mixture-of-Experts (MoE) model with 235B total parameters (22B activated). It features 94 layers, 128 experts (8 activated), and supports 262K native context length. This
Qwen3-235B-A22B-Thinking-2507 pricing
Providers
Qwen3-235B-A22B-Thinking-2507 starts at $0.300 per million input tokens and $3.00 per million output tokens via Fireworks. See all 2 providers below with their per-token pricing, latency, throughput, and modality support.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency s | Throughput | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $0.300 | $3.00 | 262.1K | 131.1K | — | — | — | |||
| $0.300 | $3.00 | 256.0K | 32.8K | — | — | fp8 |
Qwen3-235B-A22B-Thinking-2507 API
API access coming soon
Qwen3-235B-A22B-Thinking-2507 will be available through our gateway shortly.
Qwen3-235B-A22B-Thinking-2507 examples
Recent arena outputs from Qwen3-235B-A22B-Thinking-2507, picked from the highest-ranked matchups.
Qwen3-235B-A22B-Thinking-2507 license
Qwen3-235B-A22B-Thinking-2507 is released under the Apache 2.0 license, which permits commercial use, has 235.0B parameters.
- License
- Apache 2.0
- Commercial use allowed
- Parameters
- 235.0B
Apache License 2.0 - allows commercial use
FAQ
Common questions about Qwen3-235B-A22B-Thinking-2507.