Benchmarks/agents/Finance Agent

Finance Agent

Finance Agent is a benchmark for evaluating AI models on agentic financial analysis tasks, testing their ability to process financial data, perform calculations, and generate accurate analyses across various financial domains.

Progress Over Time

Interactive timeline showing model performance evolution on Finance Agent

State-of-the-art frontier
Open
Proprietary

Finance Agent Leaderboard

5 models
ContextCostLicense
11.0M$5.00 / $25.00
2200K$3.00 / $15.00
31.0M$5.00 / $25.00
4
OpenAI
OpenAI
1.0M$5.00 / $30.00
5
OpenAI
OpenAI
1.0M$2.50 / $15.00
Notice missing or incorrect data?

FAQ

Common questions about Finance Agent

Finance Agent is a benchmark for evaluating AI models on agentic financial analysis tasks, testing their ability to process financial data, perform calculations, and generate accurate analyses across various financial domains.
The Finance Agent leaderboard ranks 5 AI models based on their performance on this benchmark. Currently, Claude Opus 4.7 by Anthropic leads with a score of 0.644. The average score across all models is 0.609.
The highest Finance Agent score is 0.644, achieved by Claude Opus 4.7 from Anthropic.
5 models have been evaluated on the Finance Agent benchmark, with 0 verified results and 5 self-reported results.
Finance Agent is categorized under agents, finance, and reasoning. The benchmark evaluates text models.