Model Comparison

Mistral Large 3 vs Mistral Small 3 24B Base

Both models are evenly matched across the benchmarks.

Performance Benchmarks

Comparative analysis across standard metrics

2 benchmarks

Mistral Large 3 outperforms in 1 benchmarks (MATH), while Mistral Small 3 24B Base is better at 1 benchmark (TriviaQA).

Both models are evenly matched across the benchmarks.

Thu May 28 2026 • llm-stats.com

Arena Performance

Human preference votes

Model Size

Parameter count comparison

651.4B diff

Mistral Large 3 has 651.4B more parameters than Mistral Small 3 24B Base, making it 2760.2% larger.

Mistral AI
Mistral Large 3
675.0Bparameters
Mistral AI
Mistral Small 3 24B Base
23.6Bparameters
675.0B
Mistral Large 3
23.6B
Mistral Small 3 24B Base

Context Window

Maximum input and output token capacity

Only Mistral Large 3 specifies input context (128,000 tokens). Only Mistral Large 3 specifies output context (8,192 tokens).

Mistral AI
Mistral Large 3
Input128,000 tokens
Output8,192 tokens
Mistral AI
Mistral Small 3 24B Base
Input- tokens
Output- tokens
Thu May 28 2026 • llm-stats.com

Input Capabilities

Supported data types and modalities

Both Mistral Large 3 and Mistral Small 3 24B Base support multimodal inputs.

They are both capable of processing various types of data, offering versatility in application.

Mistral Large 3

Text
Images
Audio
Video

Mistral Small 3 24B Base

Text
Images
Audio
Video

License

Usage and distribution terms

Both models are licensed under Apache 2.0.

Both models share the same licensing terms, providing consistent usage rights.

Mistral Large 3

Apache 2.0

Open weights

Mistral Small 3 24B Base

Apache 2.0

Open weights

Release Timeline

When each model was launched

Mistral Large 3 was released on 2025-09-01, while Mistral Small 3 24B Base was released on 2025-01-30.

Mistral Large 3 is 7 months newer than Mistral Small 3 24B Base.

Mistral Large 3

Sep 1, 2025

8 months ago

7mo newer
Mistral Small 3 24B Base

Jan 30, 2025

1.3 years ago

Knowledge Cutoff

When training data ends

Mistral Small 3 24B Base has a documented knowledge cutoff of 2023-10-01, while Mistral Large 3's cutoff date is not specified.

We can confirm Mistral Small 3 24B Base's training data extends to 2023-10-01, but cannot make a direct comparison without Mistral Large 3's cutoff date.

Mistral Large 3

Mistral Small 3 24B Base

Oct 2023

Outputs Comparison

Notice missing or incorrect data?Start an Issue discussion

Key Takeaways

Larger context window (128,000 tokens)
Higher MATH score (90.4% vs 46.0%)
Higher TriviaQA score (80.3% vs 74.9%)

Detailed Comparison

AI Model Comparison Table
Feature
Mistral AI
Mistral Large 3
Mistral AI
Mistral Small 3 24B Base

FAQ

Common questions about Mistral Large 3 vs Mistral Small 3 24B Base.

Which is better, Mistral Large 3 or Mistral Small 3 24B Base?

Both models are evenly matched across the benchmarks. Mistral Large 3 is made by Mistral AI and Mistral Small 3 24B Base is made by Mistral AI. The best choice depends on your use case — compare their benchmark scores, pricing, and capabilities above.

How does Mistral Large 3 compare to Mistral Small 3 24B Base in benchmarks?

Mistral Large 3 scores MATH: 90.4%, MM-MT-Bench: 84.9%, MMLU-Redux: 82.0%, TriviaQA: 74.9%, MMMLU: 74.2%. Mistral Small 3 24B Base scores ARC-C: 91.3%, GSM8k: 80.7%, MMLU: 80.7%, TriviaQA: 80.3%, MBPP: 69.6%.

What are the context window sizes for Mistral Large 3 and Mistral Small 3 24B Base?

Mistral Large 3 supports 128K tokens and Mistral Small 3 24B Base supports an unknown number of tokens. A larger context window lets you process longer documents, conversations, or codebases in a single request.