Finance Agent

Name: Finance Agent Leaderboard — AI Model Scores
Creator: LLM Stats
License: https://llm-stats.com/legal/terms-of-service

Progress Over Time

Interactive timeline showing model performance evolution on Finance Agent

State-of-the-art frontier

Open

Proprietary

Finance Agent Leaderboard

8 models

			Context	Cost
1	Claude Opus 4.7 Anthropic	—	1.0M	$5.00 / $25.00
2	Claude Sonnet 4.6 Anthropic	—	200K	$3.00 / $15.00
3	Claude Opus 4.6 Anthropic	—	1.0M	$5.00 / $25.00
4	GPT-5.5 OpenAI	—	1.1M	$5.00 / $30.00
5	Gemini 3.5 Flash Google	—	1.0M	$1.50 / $9.00
6	GPT-5.4 OpenAI	—	1.0M	$2.50 / $15.00
7	Claude Opus 4.8 Anthropic	—	1.0M	$5.00 / $25.00
8	Nemotron 3 Ultra (550B A55B) NVIDIA	550B	—	—

Notice missing or incorrect data?

About this benchmark

What is Finance Agent?

Finance Agent is a benchmark for evaluating AI models on agentic financial analysis tasks, testing their ability to process financial data, perform calculations, and generate accurate analyses across various financial domains.

Finance Agent is a text benchmark evaluating models on reasoning, finance, and agents tasks. LLM Stats tracks 8 models on this benchmark, scored on a 0–1 scale. The current average is 0.6, with the leader at 0.6.

Compare leaders on the best AI for reasoning, best AI for finance and best AI for agents leaderboards.

Current leaders

Claude Opus 4.7 from Anthropic currently leads the Finance Agent leaderboard with a score of 0.644 across 8 evaluated AI models.

Claude Opus 4.7Anthropic64.4%

Claude Sonnet 4.6Anthropic63.3%

Claude Opus 4.6Anthropic60.7%

OSS

Nemotron 3 Ultra (550B A55B)#8 open-weight53.7%

FAQ

Common questions about the Finance Agent benchmark and leaderboard.

What is the Finance Agent benchmark?

What is the Finance Agent leaderboard?

The Finance Agent leaderboard ranks 8 AI models based on their performance on this benchmark. Currently, Claude Opus 4.7 by Anthropic leads with a score of 0.644. The average score across all models is 0.587.

What is the highest Finance Agent score?

The highest Finance Agent score is 0.644, achieved by Claude Opus 4.7 from Anthropic.

How many models are evaluated on Finance Agent?

8 models have been evaluated on the Finance Agent benchmark, with 0 verified results and 8 self-reported results.

What categories does Finance Agent cover?

Finance Agent is categorized under reasoning, finance, and agents. The benchmark evaluates text models.

What is the best open-source model on Finance Agent?

Nemotron 3 Ultra (550B A55B) by NVIDIA is the top-ranked open-source model on Finance Agent, with a score of 0.537 (rank #8).

Which model offers the best value on Finance Agent?

Among models scoring within 10% of the leader, Claude Sonnet 4.6 from Anthropic is the cheapest, at $3.00 per million input tokens with a score of 0.633.

How recent are the Finance Agent leaderboard results?

The Finance Agent leaderboard was last updated in July 2026 and currently includes 8 evaluated models.