Tau3 Banking
Progress Over Time
Interactive timeline showing model performance evolution on Tau3 Banking
Tau3 Banking Leaderboard
| Context | Cost | License | ||||
|---|---|---|---|---|---|---|
| 1 | Mistral AI | 128B | 256K | $1.50 / $7.50 |
What is Tau3 Banking?
τ³-Bench banking domain evaluates agentic models on multi-turn, tool-using customer-support scenarios in a simulated retail banking environment.
Tau3 Banking is a text benchmark evaluating models on reasoning, agents, and tool calling tasks. LLM Stats tracks 1 models on this benchmark, scored on a 0–1 scale. The current average is 0.1, with the leader at 0.1.
Compare leaders on the best AI for reasoning, best AI for agents and best AI for tool calling leaderboards.
Current leaders
Mistral Medium 3.5 from Mistral AI currently leads the Tau3 Banking leaderboard with a score of 0.134 across 1 evaluated AI models.
FAQ
Common questions about the Tau3 Banking benchmark and leaderboard.