LiveMathematicianBench
Progress Over Time
Interactive timeline showing model performance evolution on LiveMathematicianBench
LiveMathematicianBench Leaderboard
| Context | Cost | License | ||||
|---|---|---|---|---|---|---|
| 1 | ByteDance | — | — | — | ||
| 2 | Seed 2.1 ProNew ByteDance | — | — | — |
What is LiveMathematicianBench?
LiveMathematicianBench evaluates research-level mathematical reasoning on continuously refreshed, contamination-resistant problems.
LiveMathematicianBench is a text benchmark evaluating models on math and reasoning tasks. LLM Stats tracks 2 models on this benchmark, scored on a 0–1 scale. The current average is 0.2, with the leader at 0.3.
Compare leaders on the best AI for math and best AI for reasoning leaderboards.
Current leaders
Seed 2.1 Turbo from ByteDance currently leads the LiveMathematicianBench leaderboard with a score of 0.277 across 2 evaluated AI models.
FAQ
Common questions about the LiveMathematicianBench benchmark and leaderboard.