Agents struggling with bad tool calls or hallucinations? TryZeroEval
AI Leaderboard
The best AI models ranked by performance, price, and speed
288 models
| License | |||||||||
|---|---|---|---|---|---|---|---|---|---|
1 | Anthropic | 1,998 | 1,491 | 91.3% | 80.8% | 1.0M | $5.00 | $25.00 | Proprietary |
2 | Google | 1,901 | 1,222 | 94.3% | 80.6% | 1.0M | $2.50 | $15.00 | Proprietary |
3 | OpenAI | 1,630 | 1,146 | 92.8% | — | 1.0M | $2.50 | $15.00 | Proprietary |
4 | Zhipu AI | 1,594 | 1,179 | — | 77.8% | 200K | $1.00 | $3.20 | Open Source |
5 | Anthropic | 1,590 | 1,345 | 87.0% | 80.9% | 200K | $5.00 | $25.00 | Proprietary |
6 | Google | 1,579 | 1,045 | 91.9% | 76.2% | — | — | — | Proprietary |
7 | Alibaba Cloud / Qwen Team | 1,577 | 1,370 | 85.5% | 72.4% | 262K | $0.30 | $2.40 | Open Source |
8 | Google | 1,576 | 1,172 | 90.4% | 78.0% | 1.0M | $0.50 | $3.00 | Proprietary |
9 | OpenAI | 1,502 | 1,172 | 92.4% | 80.0% | 400K | $1.75 | $14.00 | Proprietary |
10 | Moonshot AI | 1,465 | 988 | 87.6% | 76.8% | 262K | $0.60 | $2.50 | Open Source |
11 | Anthropic | 1,378 | 941 | 89.9% | 79.6% | 200K | $3.00 | $15.00 | Proprietary |
12 | OpenAI | 1,301 | 1,037 | 87.3% | — | 400K | $1.25 | $10.00 | Proprietary |
13 | Alibaba Cloud / Qwen Team | 1,214 | 1,067 | 88.4% | 76.4% | 262K | $0.60 | $3.60 | Open Source |
14 | OpenAI | 1,144 | 802 | — | — | 400K | $1.75 | $14.00 | Proprietary |
15 | Zhipu AI | 1,139 | 1,079 | 81.0% | 68.0% | 131K | $0.55 | $2.19 | Open Source |
16 | Alibaba Cloud / Qwen Team | 1,134 | — | 86.6% | 72.0% | 262K | $0.40 | $3.20 | Open Source |
17 | OpenAI | 1,124 | 650 | — | — | 400K | $1.75 | $14.00 | Proprietary |
18 | OpenAI | 1,110 | 1,132 | 88.1% | — | 400K | $1.25 | $10.00 | Proprietary |
19 | Anthropic | 1,103 | 1,294 | 83.4% | — | 200K | $3.00 | $15.00 | Proprietary |
20 | OpenAI | 1,082 | 1,026 | 88.1% | — | 400K | $1.25 | $10.00 | Proprietary |
1-20 of 288
New Models
Announced in the last 15 days