Benchmarks/economics/FinSearchComp-T3

FinSearchComp-T3

FinSearchComp-T3 is a benchmark for evaluating financial search and reasoning capabilities, testing models' ability to retrieve and analyze financial information using tools.

Progress Over Time

Interactive timeline showing model performance evolution on FinSearchComp-T3

State-of-the-art frontier
Open
Proprietary

FinSearchComp-T3 Leaderboard

1 models • 0 verified
ContextCostLicense
1
1.0T
Notice missing or incorrect data?

FAQ

Common questions about FinSearchComp-T3

FinSearchComp-T3 is a benchmark for evaluating financial search and reasoning capabilities, testing models' ability to retrieve and analyze financial information using tools.
The FinSearchComp-T3 leaderboard ranks 1 AI models based on their performance on this benchmark. Currently, Kimi K2-Thinking-0905 by Moonshot AI leads with a score of 0.474. The average score across all models is 0.474.
The highest FinSearchComp-T3 score is 0.474, achieved by Kimi K2-Thinking-0905 from Moonshot AI.
1 models have been evaluated on the FinSearchComp-T3 benchmark, with 0 verified results and 1 self-reported results.
FinSearchComp-T3 is categorized under economics, finance, reasoning, and search. The benchmark evaluates text models.