OneMillion Bench
Progress Over Time
Interactive timeline showing model performance evolution on OneMillion Bench
OneMillion Bench Leaderboard
| Context | Cost | License | ||||
|---|---|---|---|---|---|---|
| 1 | Seed 2.1 ProNew ByteDance | — | — | — | ||
| 2 | ByteDance | — | — | — |
What is OneMillion Bench?
OneMillion Bench evaluates AI agents on high-economic-value tasks that require sustained, reliable execution across long-horizon real-world workflows.
OneMillion Bench is a text benchmark evaluating models on reasoning, long context, and agents tasks. LLM Stats tracks 2 models on this benchmark, scored on a 0–1 scale. The current average is 0.7, with the leader at 0.7.
Compare leaders on the best AI for reasoning, best AI for long context and best AI for agents leaderboards.
Current leaders
Seed 2.1 Pro from ByteDance currently leads the OneMillion Bench leaderboard with a score of 0.688 across 2 evaluated AI models.
FAQ
Common questions about the OneMillion Bench benchmark and leaderboard.