French MMLU

Paper

Progress Over Time

Interactive timeline showing model performance evolution on French MMLU

State-of-the-art frontier
Open
Proprietary

French MMLU Leaderboard

1 models
ContextCostLicense
18B
Notice missing or incorrect data?
About this benchmark

What is French MMLU?

French version of MMLU-Pro, a multilingual benchmark for evaluating language models' cross-lingual reasoning capabilities across 14 diverse domains including mathematics, physics, chemistry, law, engineering, psychology, and health.

French MMLU is a text benchmark evaluating models on language, legal, reasoning, finance, general, and healthcare tasks. LLM Stats tracks 1 models on this benchmark, scored on a 0–1 scale. The current average is 0.6, with the leader at 0.6.

Compare leaders on the best AI for language, best AI for legal, best AI for reasoning, best AI for finance, best AI for general and best AI for healthcare leaderboards.

Current leaders

Ministral 8B Instruct from Mistral AI currently leads the French MMLU leaderboard with a score of 0.575 across 1 evaluated AI models.

1Ministral 8B InstructMistral AI57.5%

Source paper

Title
MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation
Authors
Weihao Xuan, Rui Yang, Heli Qi, Qingcheng Zeng, and 28 others
Published
Abstract

Existing large language model (LLM) evaluation benchmarks primarily focus on English, while current multilingual tasks lack parallel questions that specifically assess cross-linguistic reasoning abilities. This dual limitation makes it challenging to comprehensively assess LLMs' performance in the multilingual setting. To fill this gap, we introduce MMLU-ProX, a comprehensive benchmark covering 29 languages, built on an English benchmark. Each language version consists of 11,829 identical questions, enabling direct cross-linguistic comparisons. Additionally, to meet efficient evaluation needs, we provide a lite version containing 658 questions per language. To ensure the high quality of MMLU-ProX, we employ a rigorous development process that involves multiple powerful LLMs for translation, followed by expert review to ensure accurate expression, consistent terminology, and cultural relevance. Building on this, we systematically evaluate 36 state-of-the-art LLMs, including reasoning-enhanced and multilingual-optimized LLMs. The results reveal significant disparities in the multilingual capabilities of LLMs: While they perform well in high-resource languages, their performance declines markedly in low-resource languages, with gaps of up to 24.3%. Through MMLU-ProX, we aim to advance the development of more inclusive AI systems and promote equitable access to technology across global contexts.

FAQ

Common questions about the French MMLU benchmark and leaderboard.

What is the French MMLU benchmark?

French version of MMLU-Pro, a multilingual benchmark for evaluating language models' cross-lingual reasoning capabilities across 14 diverse domains including mathematics, physics, chemistry, law, engineering, psychology, and health.

What is the French MMLU leaderboard?

The French MMLU leaderboard ranks 1 AI models based on their performance on this benchmark. Currently, Ministral 8B Instruct by Mistral AI leads with a score of 0.575. The average score across all models is 0.575.

What is the highest French MMLU score?

The highest French MMLU score is 0.575, achieved by Ministral 8B Instruct from Mistral AI.

How many models are evaluated on French MMLU?

1 models have been evaluated on the French MMLU benchmark, with 0 verified results and 1 self-reported results.

Where can I find the French MMLU paper?

The French MMLU paper is available at https://arxiv.org/abs/2503.10497. The paper details the methodology, dataset construction, and evaluation criteria.

What categories does French MMLU cover?

French MMLU is categorized under language, legal, reasoning, finance, general, and healthcare. The benchmark evaluates text models with multilingual support.

What is the best open-source model on French MMLU?

Ministral 8B Instruct by Mistral AI is the top-ranked open-source model on French MMLU, with a score of 0.575 (rank #1).

How recent are the French MMLU leaderboard results?

The French MMLU leaderboard was last updated in July 2026 and currently includes 1 evaluated models.