Workspace Bench

Name: Workspace Bench Leaderboard — AI Model Scores
Creator: LLM Stats
License: https://llm-stats.com/legal/terms-of-service

Progress Over Time

Interactive timeline showing model performance evolution on Workspace Bench

State-of-the-art frontier

Open

Proprietary

Workspace Bench Leaderboard

2 models

				Context	Cost	License
1	Seed 2.1 TurboNew ByteDance		—	—	—
2	Seed 2.1 ProNew ByteDance		—	—	—

Notice missing or incorrect data?

About this benchmark

What is Workspace Bench?

Workspace Bench evaluates AI agents on high-economic-value workplace tasks that span multi-step planning, file processing, and tool use across realistic office and productivity workflows.

Workspace Bench is a text benchmark evaluating models on reasoning, general, and agents tasks. LLM Stats tracks 2 models on this benchmark, scored on a 0–1 scale. The current average is 0.5, with the leader at 0.5.

Compare leaders on the best AI for reasoning, best AI for general and best AI for agents leaderboards.

Current leaders

Seed 2.1 Turbo from ByteDance currently leads the Workspace Bench leaderboard with a score of 0.547 across 2 evaluated AI models.

Seed 2.1 TurboByteDance54.7%

Seed 2.1 ProByteDance53.0%

FAQ

Common questions about the Workspace Bench benchmark and leaderboard.

What is the Workspace Bench benchmark?

Workspace Bench evaluates AI agents on high-economic-value workplace tasks that span multi-step planning, file processing, and tool use across realistic office and productivity workflows.

What is the Workspace Bench leaderboard?

The Workspace Bench leaderboard ranks 2 AI models based on their performance on this benchmark. Currently, Seed 2.1 Turbo by ByteDance leads with a score of 0.547. The average score across all models is 0.538.

What is the highest Workspace Bench score?

The highest Workspace Bench score is 0.547, achieved by Seed 2.1 Turbo from ByteDance.

How many models are evaluated on Workspace Bench?

2 models have been evaluated on the Workspace Bench benchmark, with 0 verified results and 2 self-reported results.

What categories does Workspace Bench cover?

Workspace Bench is categorized under reasoning, general, and agents. The benchmark evaluates text models.

How recent are the Workspace Bench leaderboard results?

The Workspace Bench leaderboard was last updated in June 2026 and currently includes 2 evaluated models.