Agents struggling with bad tool calls or hallucinations? TryZeroEval
AI Leaderboard
The best AI models ranked by performance, price, and speed
291 models
| License | |||||||||
|---|---|---|---|---|---|---|---|---|---|
1 | Google | 2,104 | 1,222 | 94.3% | 80.6% | 1.0M | $2.50 | $15.00 | Proprietary |
2 | Anthropic | 2,018 | 1,491 | 91.3% | 80.8% | 1.0M | $5.00 | $25.00 | Proprietary |
3 | Anthropic | 1,769 | 358 | 94.2% | 87.6% | 1.0M | $5.00 | $25.00 | Proprietary |
4 | OpenAI | 1,726 | 1,146 | 92.8% | — | 1.0M | $2.50 | $15.00 | Proprietary |
5 | Google | 1,709 | 1,143 | 90.4% | 78.0% | 1.0M | $0.50 | $3.00 | Proprietary |
6 | Anthropic | 1,616 | 1,342 | 87.0% | 80.9% | 200K | $5.00 | $25.00 | Proprietary |
7 | Google | 1,579 | 1,045 | 91.9% | 76.2% | — | — | — | Proprietary |
8 | Zhipu AI | 1,578 | 1,158 | — | 77.8% | 200K | $1.00 | $3.20 | Open Source |
9 | OpenAI | 1,518 | 1,193 | 92.4% | 80.0% | 400K | $1.75 | $14.00 | Proprietary |
10 | Moonshot AI | 1,463 | 1,003 | 87.6% | 76.8% | 262K | $0.60 | $3.00 | Open Source |
11 | Anthropic | 1,415 | 956 | 89.9% | 79.6% | 200K | $3.00 | $15.00 | Proprietary |
12 | OpenAI | 1,301 | 1,067 | 87.3% | — | — | — | — | Proprietary |
13 | OpenAI | 1,232 | 1,013 | 88.1% | 76.3% | 400K | $1.25 | $10.00 | Proprietary |
14 | GLM-5.1NEW Zhipu AI | 1,207 | -179 | 86.2% | — | 200K | $1.40 | $4.40 | Open Source |
15 | Alibaba Cloud / Qwen Team | 1,206 | 963 | 88.4% | 76.4% | 262K | $0.60 | $3.60 | Open Source |
16 | Google | 1,181 | 738 | 86.9% | — | 1.0M | $0.25 | $1.50 | Proprietary |
17 | OpenAI | 1,175 | 895 | — | — | 400K | $1.75 | $14.00 | Proprietary |
18 | Anthropic | 1,157 | 1,180 | 80.9% | 74.5% | 200K | $15.00 | $75.00 | Proprietary |
19 | OpenAI | 1,148 | 812 | — | — | 400K | $1.75 | $14.00 | Proprietary |
20 | OpenAI | 1,140 | 1,132 | 88.1% | — | — | — | — | Proprietary |
1-20 of 291
New Models
Announced in the last 15 days