InfoVQA

InfoVQA dataset with 30,000 questions and 5,000 infographic images requiring joint reasoning over document layout, textual content, graphical elements, and data visualizations with elementary reasoning and arithmetic skills

Paper

Progress Over Time

Interactive timeline showing model performance evolution on InfoVQA

State-of-the-art frontier
Open
Proprietary

InfoVQA Leaderboard

9 models
ContextCostLicense
1
Alibaba Cloud / Qwen Team
Alibaba Cloud / Qwen Team
34B
2
Alibaba Cloud / Qwen Team
Alibaba Cloud / Qwen Team
8B
3
DeepSeek
DeepSeek
27B129K
416B
56B128K$0.05 / $0.10
627B131K$0.10 / $0.20
73B
812B131K$0.05 / $0.10
94B131K$0.02 / $0.04
Notice missing or incorrect data?

FAQ

Common questions about InfoVQA

InfoVQA dataset with 30,000 questions and 5,000 infographic images requiring joint reasoning over document layout, textual content, graphical elements, and data visualizations with elementary reasoning and arithmetic skills
The InfoVQA paper is available at https://arxiv.org/abs/2104.12756. This paper provides detailed information about the benchmark methodology, dataset creation, and evaluation criteria.
The InfoVQA leaderboard ranks 9 AI models based on their performance on this benchmark. Currently, Qwen2.5 VL 32B Instruct by Alibaba Cloud / Qwen Team leads with a score of 0.834. The average score across all models is 0.716.
The highest InfoVQA score is 0.834, achieved by Qwen2.5 VL 32B Instruct from Alibaba Cloud / Qwen Team.
9 models have been evaluated on the InfoVQA benchmark, with 0 verified results and 9 self-reported results.
InfoVQA is categorized under multimodal and vision. The benchmark evaluates multimodal models.