FinSearchComp-T3
FinSearchComp-T3 is a benchmark for evaluating financial search and reasoning capabilities, testing models' ability to retrieve and analyze financial information using tools.
Progress Over Time
Interactive timeline showing model performance evolution on FinSearchComp-T3
State-of-the-art frontier
Open
Proprietary
FinSearchComp-T3 Leaderboard
1 models • 0 verified
| Context | Cost | License | ||||
|---|---|---|---|---|---|---|
1 | Moonshot AI | 1.0T | — | — |
Notice missing or incorrect data?
FAQ
Common questions about FinSearchComp-T3
FinSearchComp-T3 is a benchmark for evaluating financial search and reasoning capabilities, testing models' ability to retrieve and analyze financial information using tools.
The FinSearchComp-T3 leaderboard ranks 1 AI models based on their performance on this benchmark. Currently, Kimi K2-Thinking-0905 by Moonshot AI leads with a score of 0.474. The average score across all models is 0.474.
The highest FinSearchComp-T3 score is 0.474, achieved by Kimi K2-Thinking-0905 from Moonshot AI.
1 models have been evaluated on the FinSearchComp-T3 benchmark, with 0 verified results and 1 self-reported results.
FinSearchComp-T3 is categorized under economics, finance, reasoning, and search. The benchmark evaluates text models.