- Organizations
- Qwen
- QwQ-32B-Preview
QwQ-32B-Preview: Benchmarks, Pricing & Context Window
QwQ-32B-Preview is a language model from Qwen, released in November 2024.
An experimental research model focused on advancing AI reasoning capabilities, particularly excelling in mathematics and programming. Features deep introspection and self-questioning abilities while having some limitations in language
QwQ-32B-Preview pricing
Providers
QwQ-32B-Preview starts at $0.150 per million input tokens and $0.600 per million output tokens via DeepInfra. See all 4 providers below with their per-token pricing, latency, throughput, and modality support.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency s | Throughput | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $0.150 | $0.600 | 32.8K | 32.8K | 0.44 | 76 c/s | — | |||
| $0.200 | $0.200 | 32.8K | 32.8K | 1.05 | 32 c/s | — | |||
| $0.890 | $0.890 | 32.8K | 32.8K | 0.53 | 99 c/s | — | |||
| $1.20 | $1.20 | 32.8K | 32.8K | 0.74 | 62 c/s | — |
QwQ-32B-Preview API
API access coming soon
QwQ-32B-Preview will be available through our gateway shortly.
QwQ-32B-Preview examples
Recent arena outputs from QwQ-32B-Preview, picked from the highest-ranked matchups.
QwQ-32B-Preview license
QwQ-32B-Preview is released under the Apache 2.0 license, which permits commercial use, has 32.5B parameters, has a knowledge cutoff of November 2024.
- License
- Apache 2.0
- Commercial use allowed
- Parameters
- 32.5B
- Knowledge cutoff
- November 2024
Apache License 2.0 - allows commercial use
FAQ
Common questions about QwQ-32B-Preview.