HorizonMath
Progress Over Time
Interactive timeline showing model performance evolution on HorizonMath
HorizonMath Leaderboard
| Context | Cost | License | ||||
|---|---|---|---|---|---|---|
| 1 | Seed 2.1 ProNew ByteDance | — | — | — | ||
| 1 | ByteDance | — | — | — |
What is HorizonMath?
HorizonMath is an extremely difficult frontier mathematics benchmark designed to test the limits of mathematical reasoning on research-level and competition-beyond problems.
HorizonMath is a text benchmark evaluating models on math and reasoning tasks. LLM Stats tracks 2 models on this benchmark, scored on a 0–1 scale. The current average is 0.0, with the leader at 0.0.
Compare leaders on the best AI for math and best AI for reasoning leaderboards.
Current leaders
Seed 2.1 Pro from ByteDance currently leads the HorizonMath leaderboard with a score of 0.020 across 2 evaluated AI models.
FAQ
Common questions about the HorizonMath benchmark and leaderboard.