USAMO25

The 2025 United States of America Mathematical Olympiad (USAMO) benchmark consists of six challenging mathematical problems requiring rigorous proof-based reasoning. USAMO is the most prestigious high school mathematics competition in the United States, serving as the final round of the American Mathematics Competitions series. This benchmark evaluates models on mathematical problem-solving capabilities beyond simple numerical computation, focusing on formal mathematical reasoning and proof generation.

Paper

Progress Over Time

Interactive timeline showing model performance evolution on USAMO25

State-of-the-art frontier
Open
Proprietary

USAMO25 Leaderboard

2 models
ContextCostLicense
1
2
Notice missing or incorrect data?

FAQ

Common questions about USAMO25

The 2025 United States of America Mathematical Olympiad (USAMO) benchmark consists of six challenging mathematical problems requiring rigorous proof-based reasoning. USAMO is the most prestigious high school mathematics competition in the United States, serving as the final round of the American Mathematics Competitions series. This benchmark evaluates models on mathematical problem-solving capabilities beyond simple numerical computation, focusing on formal mathematical reasoning and proof generation.
The USAMO25 paper is available at https://arxiv.org/abs/2503.21934. This paper provides detailed information about the benchmark methodology, dataset creation, and evaluation criteria.
The USAMO25 leaderboard ranks 2 AI models based on their performance on this benchmark. Currently, Grok-4 Heavy by xAI leads with a score of 0.619. The average score across all models is 0.497.
The highest USAMO25 score is 0.619, achieved by Grok-4 Heavy from xAI.
2 models have been evaluated on the USAMO25 benchmark, with 0 verified results and 2 self-reported results.
USAMO25 is categorized under math and reasoning. The benchmark evaluates text models.