ChartMuseum
Progress Over Time
Interactive timeline showing model performance evolution on ChartMuseum
ChartMuseum Leaderboard
| Context | Cost | License | ||||
|---|---|---|---|---|---|---|
| 1 | Anthropic | — | 1.0M | $3.00 / $15.00 |
What is ChartMuseum?
ChartMuseum is a chart question-answering benchmark of 1,162 expert-annotated questions over real-world chart images drawn from 184 sources, including academic figures, infographics, and unconventional chart designs. It specifically targets questions that require visual reasoning, such as comparing unlabeled visual elements, tracking trajectories, and judging spatial relationships.
ChartMuseum is a multimodal benchmark evaluating models on multimodal, reasoning, and vision tasks. LLM Stats tracks 1 models on this benchmark, scored on a 0–1 scale. The current average is 0.9, with the leader at 0.9.
Compare leaders on the best AI for multimodal, best AI for reasoning and best AI for vision leaderboards.
Current leaders
Claude Sonnet 5 from Anthropic currently leads the ChartMuseum leaderboard with a score of 0.867 across 1 evaluated AI models.
FAQ
Common questions about the ChartMuseum benchmark and leaderboard.