Model Comparison
Codestral-22B vs DeepSeek R1 Distill Qwen 32B
Comparing Codestral-22B and DeepSeek R1 Distill Qwen 32B across benchmarks, pricing, and capabilities.
Performance Benchmarks
Comparative analysis across standard metrics
Codestral-22B and DeepSeek R1 Distill Qwen 32B don't have any common benchmark datasets to compare. They may have been evaluated on different testing suites.
Arena Performance
Human preference votes
Pricing Analysis
Price comparison per million tokens
Cost data unavailable.
Model Size
Parameter count comparison
DeepSeek R1 Distill Qwen 32B has 10.6B more parameters than Codestral-22B, making it 47.7% larger.
Context Window
Maximum input and output token capacity
Only DeepSeek R1 Distill Qwen 32B specifies input context (128,000 tokens). Only DeepSeek R1 Distill Qwen 32B specifies output context (128,000 tokens).
License
Usage and distribution terms
Codestral-22B is licensed under MNPL-0.1, while DeepSeek R1 Distill Qwen 32B uses MIT.
License differences may affect how you can use these models in commercial or open-source projects.
MNPL-0.1
Open weights
MIT
Open weights
Release Timeline
When each model was launched
Codestral-22B was released on 2024-05-29, while DeepSeek R1 Distill Qwen 32B was released on 2025-01-20.
DeepSeek R1 Distill Qwen 32B is 8 months newer than Codestral-22B.
May 29, 2024
1.9 years ago
Jan 20, 2025
1.2 years ago
7mo newerKnowledge Cutoff
When training data ends
Neither model specifies a knowledge cutoff date.
Unable to compare the recency of their training data.
Outputs Comparison
Key Takeaways
Codestral-22B
View detailsMistral AI
Detailed Comparison
| Feature |
|---|
FAQ
Common questions about Codestral-22B vs DeepSeek R1 Distill Qwen 32B