OneMillion Bench

Name: OneMillion Bench Leaderboard — AI Model Scores
Creator: LLM Stats
License: https://llm-stats.com/legal/terms-of-service

Progress Over Time

Interactive timeline showing model performance evolution on OneMillion Bench

State-of-the-art frontier

Open

Proprietary

OneMillion Bench Leaderboard

2 models

				Context	Cost	License
1	Seed 2.1 ProNew ByteDance		—	—	—
2	Seed 2.1 TurboNew ByteDance		—	—	—

Notice missing or incorrect data?

About this benchmark

What is OneMillion Bench?

OneMillion Bench evaluates AI agents on high-economic-value tasks that require sustained, reliable execution across long-horizon real-world workflows.

OneMillion Bench is a text benchmark evaluating models on reasoning, long context, and agents tasks. LLM Stats tracks 2 models on this benchmark, scored on a 0–1 scale. The current average is 0.7, with the leader at 0.7.

Compare leaders on the best AI for reasoning, best AI for long context and best AI for agents leaderboards.

Current leaders

Seed 2.1 Pro from ByteDance currently leads the OneMillion Bench leaderboard with a score of 0.688 across 2 evaluated AI models.

Seed 2.1 ProByteDance68.8%

Seed 2.1 TurboByteDance66.6%

FAQ

Common questions about the OneMillion Bench benchmark and leaderboard.

What is the OneMillion Bench benchmark?

OneMillion Bench evaluates AI agents on high-economic-value tasks that require sustained, reliable execution across long-horizon real-world workflows.

What is the OneMillion Bench leaderboard?

The OneMillion Bench leaderboard ranks 2 AI models based on their performance on this benchmark. Currently, Seed 2.1 Pro by ByteDance leads with a score of 0.688. The average score across all models is 0.677.

What is the highest OneMillion Bench score?

The highest OneMillion Bench score is 0.688, achieved by Seed 2.1 Pro from ByteDance.

How many models are evaluated on OneMillion Bench?

2 models have been evaluated on the OneMillion Bench benchmark, with 0 verified results and 2 self-reported results.

What categories does OneMillion Bench cover?

OneMillion Bench is categorized under reasoning, long context, and agents. The benchmark evaluates text models.

How recent are the OneMillion Bench leaderboard results?

The OneMillion Bench leaderboard was last updated in June 2026 and currently includes 2 evaluated models.