AI Leaderboard — Compare 300+ Top AI Models by Intelligence, Speed & Price
Independent rankings of GPT, Claude, Gemini, Llama, DeepSeek and 300+ AI models — composite LLM Stats Score, updated continuously from public benchmarks and live API metrics. See the full LLM Leaderboard for complete LLM rankings with advanced filters.
leads on reasoning
Claude Mythos Preview
94.6%gpqa
wins at coding
Claude Opus 4.6
21arena
cheapest in the top 10
Qwen3.7 Max
$1.25/M tok
fastest output
Mercury 2
925tok/s
longest context window
Grok 4 Fast
2.0M tokenstokens
best open-weights
Kimi K2.6
90.5%gpqa
| License | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
1 | Claude Mythos PreviewUNRELEASED Anthropic | 69.3 | 72.9 | 58.2 | 47.9 | — | — | — | — | Proprietary |
2 | Anthropic | 69.0 | 66.2 | 52.7 | 44.2 | 1,634 | 1.0M | 70c/s | $7.22 | Proprietary |
3 | OpenAI | 63.1 | 62.5 | 51.2 | 42.5 | 2,105 | 1.1M | 149c/s | $7.78 | Proprietary |
4 | OpenAI | 61.4 | 56.9 | — | 28.6 | — | — | — | — | Proprietary |
5 | Anthropic | 61.0 | 62.8 | 49.3 | 40.3 | 1,920 | 1.0M | 105c/s | $7.22 | Proprietary |
6 | OpenAI | 60.4 | 57.8 | 42.9 | 36.6 | 1,733 | 1.0M | 126c/s | $3.89 | Proprietary |
7 | Google | 59.8 | 59.6 | 46.6 | 40.4 | 1,629 | 1.0M | 82c/s | $2.33 | Proprietary |
8 | Google | 58.8 | 59.7 | 43.4 | 33.7 | 2,101 | 1.0M | 164c/s | $3.89 | Proprietary |
9 | Moonshot AI | 57.7 | 58.3 | 43.7 | 37.2 | 1,556 | 262K | 58c/s | $1.29 | Open Source |
10 | Anthropic | 57.5 | 59.8 | 43.9 | 36.8 | 2,132 | 1.0M | 39c/s | $7.22 | Proprietary |
11 | Alibaba Cloud / Qwen Team | 56.5 | 60.5 | 48.2 | 39.1 | 1,634 | 1.0M | 120c/s | $1.53 | Proprietary |
12 | OpenAI | 56.4 | 53.8 | 34.8 | 25.0 | 1,530 | 400K | 216c/s | $3.11 | Proprietary |
13 | Google | 56.2 | 50.0 | 32.4 | 23.0 | 1,579 | — | — | — | Proprietary |
14 | ByteDance | 55.9 | 53.7 | 32.6 | 28.1 | — | — | — | — | Proprietary |
15 | Google | 55.0 | 49.6 | 30.8 | 24.4 | 1,714 | 1.0M | 247c/s | $0.78 | Proprietary |
1-15 of 309
Recent
New Models
Announced in the last 15 days.
Index
Performance Index
Composite TrueSkill ratings across published benchmarks.
FAQ
Quick answers for choosing, comparing and interpreting today's leading AI models.