DeepSeek logo

DeepSeek R1 Distill Qwen 32B

DeepSeek
deepseek-r1-distill-qwen-32bVariant

Overview

DeepSeek-R1 is the first-generation reasoning model built atop DeepSeek-V3 (671B total parameters, 37B activated per token). It incorporates large-scale reinforcement learning (RL) to enhance its chain-of-thought and reasoning capabilities, delivering strong performance in math, code, and multi-step reasoning tasks.

DeepSeek R1 Distill Qwen 32B was released on January 20, 2025. API access is available through DeepInfra.

Performance

Timeline

Release DateUnknown
Knowledge CutoffUnknown

Other Details

Parameters
32.8B
License
MIT
Training Data
Unknown

Related Models

Compare DeepSeek R1 Distill Qwen 32B to other models by quality (GPQA score) vs cost. Higher scores and lower costs represent better value.

Performance visualization loading...

Gathering benchmark data from similar models

Benchmarks

DeepSeek R1 Distill Qwen 32B Performance Across Datasets

Scores sourced from the model's scorecard, paper, or official blog posts

LLM Stats Logollm-stats.com - Tue Dec 16 2025
Notice missing or incorrect data?Start an Issue discussion

Pricing

Pricing, performance, and capabilities for DeepSeek R1 Distill Qwen 32B across different providers:

ProviderInput ($/M)Output ($/M)Max InputMax OutputLatency (s)ThroughputQuantizationInputOutput
DeepInfra logo
DeepInfra
$0.12$0.18128.0K128.0K0.6537.0 tok/s
Text
Image
Audio
Video
Text
Image
Audio
Video

Example Outputs

Recent Posts

Recent Reviews

API Access

API Access Coming Soon

API access for DeepSeek R1 Distill Qwen 32B will be available soon through our gateway.

FAQ

Common questions about DeepSeek R1 Distill Qwen 32B

DeepSeek R1 Distill Qwen 32B was released on January 20, 2025.
DeepSeek R1 Distill Qwen 32B has 32.8 billion parameters.