xDailyBench
Progress Over Time
Interactive timeline showing model performance evolution on xDailyBench
xDailyBench Leaderboard
| Context | Cost | License | ||||
|---|---|---|---|---|---|---|
| 1 | Seed 2.1 ProNew ByteDance | — | — | — | ||
| 2 | ByteDance | — | — | — |
What is xDailyBench?
xDailyBench evaluates AI agents on white-collar office work, covering everyday professional tasks such as document handling, consultation, and multi-step productivity workflows.
xDailyBench is a text benchmark evaluating models on reasoning, general, and agents tasks. LLM Stats tracks 2 models on this benchmark, scored on a 0–1 scale. The current average is 0.6, with the leader at 0.6.
Compare leaders on the best AI for reasoning, best AI for general and best AI for agents leaderboards.
Current leaders
Seed 2.1 Pro from ByteDance currently leads the xDailyBench leaderboard with a score of 0.610 across 2 evaluated AI models.
FAQ
Common questions about the xDailyBench benchmark and leaderboard.