USAMO25
The 2025 United States of America Mathematical Olympiad (USAMO) benchmark consists of six challenging mathematical problems requiring rigorous proof-based reasoning. USAMO is the most prestigious high school mathematics competition in the United States, serving as the final round of the American Mathematics Competitions series. This benchmark evaluates models on mathematical problem-solving capabilities beyond simple numerical computation, focusing on formal mathematical reasoning and proof generation.
Progress Over Time
Interactive timeline showing model performance evolution on USAMO25
State-of-the-art frontier
Open
Proprietary
USAMO25 Leaderboard
2 models
| Context | Cost | License | ||||
|---|---|---|---|---|---|---|
| 1 | xAI | — | — | — | ||
| 2 | xAI | — | — | — |
Notice missing or incorrect data?
FAQ
Common questions about USAMO25
The 2025 United States of America Mathematical Olympiad (USAMO) benchmark consists of six challenging mathematical problems requiring rigorous proof-based reasoning. USAMO is the most prestigious high school mathematics competition in the United States, serving as the final round of the American Mathematics Competitions series. This benchmark evaluates models on mathematical problem-solving capabilities beyond simple numerical computation, focusing on formal mathematical reasoning and proof generation.
The USAMO25 paper is available at https://arxiv.org/abs/2503.21934. This paper provides detailed information about the benchmark methodology, dataset creation, and evaluation criteria.
The USAMO25 leaderboard ranks 2 AI models based on their performance on this benchmark. Currently, Grok-4 Heavy by xAI leads with a score of 0.619. The average score across all models is 0.497.
The highest USAMO25 score is 0.619, achieved by Grok-4 Heavy from xAI.
2 models have been evaluated on the USAMO25 benchmark, with 0 verified results and 2 self-reported results.
USAMO25 is categorized under math and reasoning. The benchmark evaluates text models.