Tau3 Banking

Progress Over Time

Interactive timeline showing model performance evolution on Tau3 Banking

State-of-the-art frontier
Open
Proprietary

Tau3 Banking Leaderboard

1 models
ContextCostLicense
1128B256K$1.50 / $7.50
Notice missing or incorrect data?
About this benchmark

What is Tau3 Banking?

τ³-Bench banking domain evaluates agentic models on multi-turn, tool-using customer-support scenarios in a simulated retail banking environment.

Tau3 Banking is a text benchmark evaluating models on reasoning, agents, and tool calling tasks. LLM Stats tracks 1 models on this benchmark, scored on a 0–1 scale. The current average is 0.1, with the leader at 0.1.

Compare leaders on the best AI for reasoning, best AI for agents and best AI for tool calling leaderboards.

Current leaders

Mistral Medium 3.5 from Mistral AI currently leads the Tau3 Banking leaderboard with a score of 0.134 across 1 evaluated AI models.

1Mistral Medium 3.5Mistral AI13.4%

FAQ

Common questions about the Tau3 Banking benchmark and leaderboard.

What is the Tau3 Banking benchmark?

τ³-Bench banking domain evaluates agentic models on multi-turn, tool-using customer-support scenarios in a simulated retail banking environment.

What is the Tau3 Banking leaderboard?

The Tau3 Banking leaderboard ranks 1 AI models based on their performance on this benchmark. Currently, Mistral Medium 3.5 by Mistral AI leads with a score of 0.134. The average score across all models is 0.134.

What is the highest Tau3 Banking score?

The highest Tau3 Banking score is 0.134, achieved by Mistral Medium 3.5 from Mistral AI.

How many models are evaluated on Tau3 Banking?

1 models have been evaluated on the Tau3 Banking benchmark, with 0 verified results and 1 self-reported results.

What categories does Tau3 Banking cover?

Tau3 Banking is categorized under reasoning, agents, and tool calling. The benchmark evaluates text models.

What is the best open-source model on Tau3 Banking?

Mistral Medium 3.5 by Mistral AI is the top-ranked open-source model on Tau3 Banking, with a score of 0.134 (rank #1).

Which model offers the best value on Tau3 Banking?

Among models scoring within 10% of the leader, Mistral Medium 3.5 from Mistral AI is the cheapest, at $1.50 per million input tokens with a score of 0.134.

How recent are the Tau3 Banking leaderboard results?

The Tau3 Banking leaderboard was last updated in July 2026 and currently includes 1 evaluated models.