
DeepSeek R1 Distill Qwen 7B
DeepSeekOverview
DeepSeek-R1 is the first-generation reasoning model built atop DeepSeek-V3 (671B total parameters, 37B activated per token). It incorporates large-scale reinforcement learning (RL) to enhance its chain-of-thought and reasoning capabilities, delivering strong performance in math, code, and multi-step reasoning tasks.
DeepSeek R1 Distill Qwen 7B was released on January 20, 2025.
Performance
Timeline
Other Details
Related Models
Compare DeepSeek R1 Distill Qwen 7B to other models by quality (GPQA score) vs cost. Higher scores and lower costs represent better value.
Performance visualization loading...
Gathering benchmark data from similar models
Benchmarks
DeepSeek R1 Distill Qwen 7B Performance Across Datasets
Scores sourced from the model's scorecard, paper, or official blog posts
Pricing
Pricing, performance, and capabilities for DeepSeek R1 Distill Qwen 7B across different providers:
Example Outputs
Recent Posts
Recent Reviews
API Access
API Access Coming Soon
API access for DeepSeek R1 Distill Qwen 7B will be available soon through our gateway.
FAQ
Common questions about DeepSeek R1 Distill Qwen 7B
