Name: QwQ-32B-Preview
Author: Qwen

Question 1

When was QwQ-32B-Preview released?

Accepted Answer

QwQ-32B-Preview was released on November 28, 2024 by Qwen. This is the official QwQ-32B-Preview release date tracked on LLM Stats.

Question 2

How much does QwQ-32B-Preview cost?

Accepted Answer

QwQ-32B-Preview pricing starts at $0.15 per million input tokens and $0.60 per million output tokens via DeepInfra, the lowest price among tracked providers.

Question 3

Is QwQ-32B-Preview available via API?

Accepted Answer

Yes, QwQ-32B-Preview is available via API. See the official documentation for authentication and endpoint details. It is served by 4 providers tracked on LLM Stats.

Question 4

How big is QwQ-32B-Preview?

Accepted Answer

QwQ-32B-Preview has 32.5 billion parameters. It ships as an open-weight model, so you can download and run it on your own hardware.

Question 5

Who created QwQ-32B-Preview?

Accepted Answer

QwQ-32B-Preview was created by Qwen.

Question 6

What is the license for QwQ-32B-Preview?

Accepted Answer

QwQ-32B-Preview is released under the Apache 2.0 license. This is an open-source / open-weight license that permits self-hosting.

Question 7

What is the knowledge cutoff date for QwQ-32B-Preview?

Accepted Answer

QwQ-32B-Preview has a knowledge cutoff of November 2024, meaning it was trained on data up to that point and may not know about events after it.

Question 8

What is QwQ-32B-Preview latency?

Accepted Answer

QwQ-32B-Preview p95 time to first token is 0.44 seconds via DeepInfra over the trailing 7 days. Lower time to first token means the model begins responding sooner for chat, agents and API workloads.

Question 9

Where can I use QwQ-32B-Preview?

Accepted Answer

QwQ-32B-Preview is available through 4 providers including DeepInfra, Hyperbolic, Fireworks, and 1 more.

Question 10

Where is the QwQ-32B-Preview paper or technical report?

Accepted Answer

QwQ-32B-Preview has a paper or technical report available at https://arxiv.org/abs/2407.10671. Use that source for architecture, training, release and evaluation details.

Question 11

What models should I compare QwQ-32B-Preview against?

Accepted Answer

Common QwQ-32B-Preview comparisons include QwQ-32B-Preview vs DeepSeek R1 Zero, QwQ-32B-Preview vs DeepSeek R1 Distill Llama 70B, QwQ-32B-Preview vs QwQ-32B. Compare them side by side for benchmark scores, pricing, context window, latency and API availability.

Provider	Input $/M	Output $/M	Context in / out	TTFT p50 / p95 s	Output avg / p5 c/s	Success 7d	Modalities in / out
DeepInfra	$0.150	$0.600	32.8K/32.8K	—/0.44	76/—	—	/
Hyperbolic	$0.200	$0.200	32.8K/32.8K	—/1.05	32/—	—	/
Fireworks	$0.890	$0.890	32.8K/32.8K	—/0.53	99/—	—	/
Together	$1.20	$1.20	32.8K/32.8K	—/0.74	62/—	—	/

QwQ-32B-Preview: API Pricing, Context Window & Benchmarks

QwQ-32B-Preview benchmarks

Rankings

Quality Tracker

QwQ-32B-Preview Performance Across Datasets

QwQ-32B-Preview pricing

Providers

QwQ-32B-Preview model size

QwQ-32B-Preview context window