DeepSeek R1 Distill Llama 8B

Name: DeepSeek R1 Distill Llama 8B
Author: DeepSeek

DeepSeek·Jan 2025·MIT

DeepSeek-R1 is the first-generation reasoning model built atop DeepSeek-V3 (671B total parameters, 37B activated per token). It incorporates large-scale reinforcement learning (RL) to enhance its chain-of-thought and reasoning

Parameters8.0B

Benchmarks

Arena Performance

DeepSeek R1 Distill Llama 8B Performance Across Datasets

Scores sourced from the model's scorecard, paper, or official blog posts

llm-stats.com - Wed Feb 25 2026

Notice missing or incorrect data?Start an Issue discussion→

FAQ

Common questions about DeepSeek R1 Distill Llama 8B

DeepSeek R1 Distill Llama 8B was released on January 20, 2025 by DeepSeek.

DeepSeek R1 Distill Llama 8B was created by DeepSeek.

DeepSeek R1 Distill Llama 8B has 8.0 billion parameters.

DeepSeek R1 Distill Llama 8B is released under the MIT license. This is an open-source/open-weight license.

DeepSeek R1 Distill Llama 8B

Benchmarks

Arena Performance

DeepSeek R1 Distill Llama 8B Performance Across Datasets

FAQ

When was DeepSeek R1 Distill Llama 8B released?

Who created DeepSeek R1 Distill Llama 8B?

How many parameters does DeepSeek R1 Distill Llama 8B have?

What is the license for DeepSeek R1 Distill Llama 8B?