TreeBench
Progress Over Time
Interactive timeline showing model performance evolution on TreeBench
TreeBench Leaderboard
| Context | Cost | License | ||||
|---|---|---|---|---|---|---|
| 1 | Seed 2.1 ProNew ByteDance | — | — | — | ||
| 1 | ByteDance | — | — | — |
What is TreeBench?
TreeBench evaluates visual grounded reasoning, requiring models to localize and reason about fine-grained visual details.
TreeBench is a multimodal benchmark evaluating models on multimodal, reasoning, spatial reasoning, and vision tasks. LLM Stats tracks 2 models on this benchmark, scored on a 0–1 scale. The current average is 0.7, with the leader at 0.7.
Compare leaders on the best AI for multimodal, best AI for reasoning, best AI for spatial reasoning and best AI for vision leaderboards.
Current leaders
Seed 2.1 Pro from ByteDance currently leads the TreeBench leaderboard with a score of 0.711 across 2 evaluated AI models.
FAQ
Common questions about the TreeBench benchmark and leaderboard.