GDPval-MM
GDPval-MM is the multimodal variant of the GDPval benchmark, evaluating AI model performance on real-world economically valuable tasks that require processing and generating multimodal content including documents, slides, diagrams, spreadsheets, images, and other professional deliverables across diverse industries.
GPT-5.5 from OpenAI currently leads the GDPval-MM leaderboard with a score of 0.849 across 3 evaluated AI models.