Agents struggling with bad tool calls or hallucinations? TryZeroEval
AI Leaderboard
The best AI models ranked by performance, price, and speed
291 models
| License | |||||||||
|---|---|---|---|---|---|---|---|---|---|
1 | Google | 2,083 | 1,222 | 94.3% | 80.6% | 1.0M | $2.50 | $15.00 | Proprietary |
2 | Anthropic | 2,018 | 1,491 | 91.3% | 80.8% | 1.0M | $5.00 | $25.00 | Proprietary |
3 | Anthropic | 1,774 | 358 | 94.2% | 87.6% | 1.0M | $5.00 | $25.00 | Proprietary |
4 | OpenAI | 1,744 | 1,146 | 92.8% | — | 1.0M | $2.50 | $15.00 | Proprietary |
5 | Google | 1,704 | 1,143 | 90.4% | 78.0% | 1.0M | $0.50 | $3.00 | Proprietary |
6 | Anthropic | 1,616 | 1,342 | 87.0% | 80.9% | 200K | $5.00 | $25.00 | Proprietary |
7 | Google | 1,579 | 1,045 | 91.9% | 76.2% | — | — | — | Proprietary |
8 | Zhipu AI | 1,575 | 1,158 | — | 77.8% | 200K | $1.00 | $3.20 | Open Source |
9 | OpenAI | 1,517 | 1,193 | 92.4% | 80.0% | 400K | $1.75 | $14.00 | Proprietary |
10 | Moonshot AI | 1,479 | 1,003 | 87.6% | 76.8% | 262K | $0.60 | $3.00 | Open Source |
11 | Anthropic | 1,414 | 956 | 89.9% | 79.6% | 200K | $3.00 | $15.00 | Proprietary |
12 | Zhipu AI | 1,332 | -179 | 86.2% | — | 200K | $1.40 | $4.40 | Open Source |
13 | OpenAI | 1,301 | 1,067 | 87.3% | — | — | — | — | Proprietary |
14 | OpenAI | 1,242 | 895 | — | — | 400K | $1.75 | $14.00 | Proprietary |
15 | OpenAI | 1,232 | 1,013 | 88.1% | 76.3% | 400K | $1.25 | $10.00 | Proprietary |
16 | Alibaba Cloud / Qwen Team | 1,208 | 963 | 88.4% | 76.4% | 262K | $0.60 | $3.60 | Open Source |
17 | Google | 1,170 | 738 | 86.9% | — | 1.0M | $0.25 | $1.50 | Proprietary |
18 | Anthropic | 1,155 | 1,180 | 80.9% | 74.5% | 200K | $15.00 | $75.00 | Proprietary |
19 | OpenAI | 1,148 | 812 | — | — | 400K | $1.75 | $14.00 | Proprietary |
20 | OpenAI | 1,140 | 1,132 | 88.1% | — | — | — | — | Proprietary |
1-20 of 291
New Models
Announced in the last 15 days