Benchmarks/agents/OfficeQA Pro

OfficeQA Pro

OfficeQA Pro evaluates AI models on professional knowledge-work questions and tasks drawn from real office workflows, including document analysis, spreadsheet reasoning, and information synthesis across business domains.

Progress Over Time

Interactive timeline showing model performance evolution on OfficeQA Pro

State-of-the-art frontier
Open
Proprietary

OfficeQA Pro Leaderboard

1 models
ContextCostLicense
1
OpenAI
OpenAI
1.0M$5.00 / $30.00
Notice missing or incorrect data?

FAQ

Common questions about OfficeQA Pro

OfficeQA Pro evaluates AI models on professional knowledge-work questions and tasks drawn from real office workflows, including document analysis, spreadsheet reasoning, and information synthesis across business domains.
The OfficeQA Pro leaderboard ranks 1 AI models based on their performance on this benchmark. Currently, GPT-5.5 by OpenAI leads with a score of 0.541. The average score across all models is 0.541.
The highest OfficeQA Pro score is 0.541, achieved by GPT-5.5 from OpenAI.
1 models have been evaluated on the OfficeQA Pro benchmark, with 0 verified results and 1 self-reported results.
OfficeQA Pro is categorized under agents, general, and reasoning. The benchmark evaluates text models.