FinSearchComp-T3
Progress Over Time
Interactive timeline showing model performance evolution on FinSearchComp-T3
FinSearchComp-T3 Leaderboard
| Context | Cost | License | ||||
|---|---|---|---|---|---|---|
| 1 | Moonshot AI | 1.0T | — | — |
What is FinSearchComp-T3?
FinSearchComp-T3 is a benchmark for evaluating financial search and reasoning capabilities, testing models' ability to retrieve and analyze financial information using tools.
FinSearchComp-T3 is a text benchmark evaluating models on reasoning, search, finance, and economics tasks. LLM Stats tracks 1 models on this benchmark, scored on a 0–1 scale. The current average is 0.5, with the leader at 0.5.
Compare leaders on the best AI for reasoning, best AI for search, best AI for finance and best AI for economics leaderboards.
Current leaders
Kimi K2-Thinking-0905 from Moonshot AI currently leads the FinSearchComp-T3 leaderboard with a score of 0.474 across 1 evaluated AI models.
FAQ
Common questions about the FinSearchComp-T3 benchmark and leaderboard.