- Organizations
- Qwen
- Qwen3 30B A3B
Qwen3 30B A3B: Benchmarks, Pricing & Context Window
Qwen3 30B A3B is a language model from Qwen, released in April 2025.
Qwen3-30B-A3B is a smaller Mixture-of-Experts (MoE) model from the Qwen3 series by Alibaba, with 30.5 billion total parameters and 3.3 billion activated parameters. Features hybrid thinking/non-thinking modes, support for 119 languages,
Qwen3 30B A3B pricing
Providers
Qwen3 30B A3B starts at $0.100 per million input tokens and $0.300 per million output tokens via DeepInfra. See all 3 providers below with their per-token pricing, latency, throughput, and modality support.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency s | Throughput | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $0.100 | $0.300 | 128.0K | 128.0K | 0.51 | 336 c/s | — | |||
| $0.100 | $0.440 | 128.0K | 128.0K | 0.73 | 89 c/s | — | |||
| $0.890 | $0.890 | 128.0K | 128.0K | 0.66 | 122 c/s | — |
Qwen3 30B A3B API
API access coming soon
Qwen3 30B A3B will be available through our gateway shortly.
Qwen3 30B A3B examples
Recent arena outputs from Qwen3 30B A3B, picked from the highest-ranked matchups.
Qwen3 30B A3B license
Qwen3 30B A3B is released under the Apache 2.0 license, which permits commercial use, has 30.5B parameters.
- License
- Apache 2.0
- Commercial use allowed
- Parameters
- 30.5B
Apache License 2.0 - allows commercial use
FAQ
Common questions about Qwen3 30B A3B.