Model Comparison

Mistral Large 3 vs Mistral Small 3 24B Base

Both models are evenly matched across the benchmarks.

Want to compare interactively?Try the playground

Performance Benchmarks

Comparative analysis across standard metrics

2 benchmarks

Mistral Large 3 outperforms in 1 benchmarks (MATH), while Mistral Small 3 24B Base is better at 1 benchmark (TriviaQA).

Both models are evenly matched across the benchmarks.

Thu May 28 2026 • llm-stats.com

Arena Performance

Human preference votes

Model Size

Parameter count comparison

651.4B diff

Mistral Large 3 has 651.4B more parameters than Mistral Small 3 24B Base, making it 2760.2% larger.

Mistral Large 3

675.0Bparameters

Mistral Small 3 24B Base

23.6Bparameters

675.0B

Mistral Large 3

23.6B

Mistral Small 3 24B Base

Context Window

Maximum input and output token capacity

Only Mistral Large 3 specifies input context (128,000 tokens). Only Mistral Large 3 specifies output context (8,192 tokens).

Mistral Large 3

Input128,000 tokens

Output8,192 tokens

Mistral Small 3 24B Base

Input- tokens

Output- tokens

Thu May 28 2026 • llm-stats.com

Input Capabilities

Supported data types and modalities

Both Mistral Large 3 and Mistral Small 3 24B Base support multimodal inputs.

They are both capable of processing various types of data, offering versatility in application.

Mistral Large 3

Text

Images

Audio

Video

Mistral Small 3 24B Base

Text

Images

Audio

Video

License

Usage and distribution terms

Both models are licensed under Apache 2.0.

Both models share the same licensing terms, providing consistent usage rights.

Mistral Large 3

Apache 2.0

Open weights

Mistral Small 3 24B Base

Apache 2.0

Open weights

Release Timeline

When each model was launched

Mistral Large 3 was released on 2025-09-01, while Mistral Small 3 24B Base was released on 2025-01-30.

Mistral Large 3 is 7 months newer than Mistral Small 3 24B Base.

Mistral Large 3

Sep 1, 2025

8 months ago

7mo newer

Mistral Small 3 24B Base

Jan 30, 2025

1.3 years ago

Knowledge Cutoff

When training data ends

Mistral Small 3 24B Base has a documented knowledge cutoff of 2023-10-01, while Mistral Large 3's cutoff date is not specified.

We can confirm Mistral Small 3 24B Base's training data extends to 2023-10-01, but cannot make a direct comparison without Mistral Large 3's cutoff date.

Mistral Large 3

—

Mistral Small 3 24B Base

Oct 2023

Outputs Comparison

Notice missing or incorrect data?Start an Issue discussion→

Key Takeaways

Mistral Large 3

View details

Mistral AI

Larger context window (128,000 tokens)

Higher MATH score (90.4% vs 46.0%)

Mistral Small 3 24B Base

View details

Mistral AI

Higher TriviaQA score (80.3% vs 74.9%)

Detailed Comparison

AI Model Comparison Table
Feature	Mistral Large 3	Mistral Small 3 24B Base

FAQ

Common questions about Mistral Large 3 vs Mistral Small 3 24B Base.

Which is better, Mistral Large 3 or Mistral Small 3 24B Base?

Both models are evenly matched across the benchmarks. Mistral Large 3 is made by Mistral AI and Mistral Small 3 24B Base is made by Mistral AI. The best choice depends on your use case — compare their benchmark scores, pricing, and capabilities above.

How does Mistral Large 3 compare to Mistral Small 3 24B Base in benchmarks?

Mistral Large 3 scores MATH: 90.4%, MM-MT-Bench: 84.9%, MMLU-Redux: 82.0%, TriviaQA: 74.9%, MMMLU: 74.2%. Mistral Small 3 24B Base scores ARC-C: 91.3%, GSM8k: 80.7%, MMLU: 80.7%, TriviaQA: 80.3%, MBPP: 69.6%.

What are the context window sizes for Mistral Large 3 and Mistral Small 3 24B Base?

Mistral Large 3 supports 128K tokens and Mistral Small 3 24B Base supports an unknown number of tokens. A larger context window lets you process longer documents, conversations, or codebases in a single request.