SpreadSheetBench-v1

Name: SpreadSheetBench-v1 Leaderboard — AI Model Scores
Creator: LLM Stats
License: https://llm-stats.com/legal/terms-of-service

Progress Over Time

Interactive timeline showing model performance evolution on SpreadSheetBench-v1

State-of-the-art frontier

Open

Proprietary

SpreadSheetBench-v1 Leaderboard

3 models

			Context	Cost
1	MiniMax M3 MiniMax	—	1.0M	$0.30 / $1.20
2	Qwen3.7 Max Alibaba Cloud / Qwen Team	—	1.0M	$1.25 / $3.75
3	Qwen3.7-Plus Alibaba Cloud / Qwen Team	—	1.0M	$0.32 / $1.28

Notice missing or incorrect data?

About this benchmark

What is SpreadSheetBench-v1?

SpreadSheetBench-v1 evaluates office automation agents on spreadsheet reasoning and manipulation tasks, measuring the ability to analyze, transform, and operate on spreadsheet data through tools.

SpreadSheetBench-v1 is a text benchmark evaluating models on productivity, agents, and tool calling tasks. LLM Stats tracks 3 models on this benchmark, scored on a 0–1 scale. The current average is 0.9, with the leader at 0.9.

Compare leaders on the best AI for productivity, best AI for agents and best AI for tool calling leaderboards.

Current leaders

MiniMax M3 from MiniMax currently leads the SpreadSheetBench-v1 leaderboard with a score of 0.893 across 3 evaluated AI models.

MiniMax M3MiniMax89.3%

Qwen3.7 MaxAlibaba Cloud / Qwen Team87.0%

Qwen3.7-PlusAlibaba Cloud / Qwen Team86.3%

FAQ

Common questions about the SpreadSheetBench-v1 benchmark and leaderboard.

What is the SpreadSheetBench-v1 benchmark?

SpreadSheetBench-v1 evaluates office automation agents on spreadsheet reasoning and manipulation tasks, measuring the ability to analyze, transform, and operate on spreadsheet data through tools.

What is the SpreadSheetBench-v1 leaderboard?

The SpreadSheetBench-v1 leaderboard ranks 3 AI models based on their performance on this benchmark. Currently, MiniMax M3 by MiniMax leads with a score of 0.893. The average score across all models is 0.876.

What is the highest SpreadSheetBench-v1 score?

The highest SpreadSheetBench-v1 score is 0.893, achieved by MiniMax M3 from MiniMax.

How many models are evaluated on SpreadSheetBench-v1?

3 models have been evaluated on the SpreadSheetBench-v1 benchmark, with 0 verified results and 3 self-reported results.

What categories does SpreadSheetBench-v1 cover?

SpreadSheetBench-v1 is categorized under productivity, agents, and tool calling. The benchmark evaluates text models.

What is the best open-source model on SpreadSheetBench-v1?

MiniMax M3 by MiniMax is the top-ranked open-source model on SpreadSheetBench-v1, with a score of 0.893 (rank #1).

Which model offers the best value on SpreadSheetBench-v1?

Among models scoring within 10% of the leader, MiniMax M3 from MiniMax is the cheapest, at $0.30 per million input tokens with a score of 0.893.

How recent are the SpreadSheetBench-v1 leaderboard results?

The SpreadSheetBench-v1 leaderboard was last updated in July 2026 and currently includes 3 evaluated models.