Model Comparison
DeepSeek-R1-0528 vs Ministral 3 (8B Reasoning 2512)
DeepSeek-R1-0528 significantly outperforms across most benchmarks. Ministral 3 (8B Reasoning 2512) is 6.1x cheaper per token.
Performance Benchmarks
Comparative analysis across standard metrics
DeepSeek-R1-0528 outperforms in 4 benchmarks (AIME 2024, AIME 2025, GPQA, LiveCodeBench), while Ministral 3 (8B Reasoning 2512) is better at 0 benchmarks.
DeepSeek-R1-0528 significantly outperforms across most benchmarks.
Arena Performance
Human preference votes
Pricing Analysis
Price comparison per million tokens
For input processing, DeepSeek-R1-0528 ($0.50/1M tokens) is 3.3x more expensive than Ministral 3 (8B Reasoning 2512) ($0.15/1M tokens).
For output processing, DeepSeek-R1-0528 ($2.15/1M tokens) is 14.3x more expensive than Ministral 3 (8B Reasoning 2512) ($0.15/1M tokens).
In conclusion, DeepSeek-R1-0528 is more expensive than Ministral 3 (8B Reasoning 2512).*
* Using a 3:1 ratio of input to output tokens
Model Size
Parameter count comparison
DeepSeek-R1-0528 has 663.0B more parameters than Ministral 3 (8B Reasoning 2512), making it 8287.5% larger.
Context Window
Maximum input and output token capacity
Ministral 3 (8B Reasoning 2512) accepts 262,100 input tokens compared to DeepSeek-R1-0528's 131,072 tokens. Ministral 3 (8B Reasoning 2512) can generate longer responses up to 262,100 tokens, while DeepSeek-R1-0528 is limited to 131,072 tokens.
Input Capabilities
Supported data types and modalities
Ministral 3 (8B Reasoning 2512) supports multimodal inputs, whereas DeepSeek-R1-0528 does not.
Ministral 3 (8B Reasoning 2512) can handle both text and other forms of data like images, making it suitable for multimodal applications.
DeepSeek-R1-0528
Ministral 3 (8B Reasoning 2512)
License
Usage and distribution terms
DeepSeek-R1-0528 is licensed under MIT, while Ministral 3 (8B Reasoning 2512) uses Apache 2.0.
License differences may affect how you can use these models in commercial or open-source projects.
MIT
Open weights
Apache 2.0
Open weights
Release Timeline
When each model was launched
DeepSeek-R1-0528 was released on 2025-05-28, while Ministral 3 (8B Reasoning 2512) was released on 2025-12-04.
Ministral 3 (8B Reasoning 2512) is 6 months newer than DeepSeek-R1-0528.
May 28, 2025
10 months ago
Dec 4, 2025
4 months ago
6mo newerKnowledge Cutoff
When training data ends
Neither model specifies a knowledge cutoff date.
Unable to compare the recency of their training data.
Provider Availability
DeepSeek-R1-0528 is available from DeepInfra, DeepSeek, Novita. Ministral 3 (8B Reasoning 2512) is available from Mistral AI.
DeepSeek-R1-0528
Ministral 3 (8B Reasoning 2512)
Outputs Comparison
Key Takeaways
DeepSeek-R1-0528
View detailsDeepSeek
Detailed Comparison
| Feature |
|---|
FAQ
Common questions about DeepSeek-R1-0528 vs Ministral 3 (8B Reasoning 2512)