LMArena Text Leaderboard
Progress Over Time
Interactive timeline showing model performance evolution on LMArena Text Leaderboard
LMArena Text Leaderboard Leaderboard
| Context | Cost | License | ||||
|---|---|---|---|---|---|---|
| 1 | — | — | — | |||
| 2 | xAI | — | — | — |
What is LMArena Text Leaderboard?
LMArena Text Leaderboard is a blind human preference evaluation benchmark that ranks models based on pairwise comparisons in real-world conversations. The leaderboard uses Elo ratings computed from user preferences in head-to-head model battles, providing a comprehensive measure of overall model capability and style.
LMArena Text Leaderboard is a text benchmark evaluating models on reasoning and general tasks. LLM Stats tracks 2 models on this benchmark, scored on a 0–2000 scale. The current average is 1474.0, with the leader at 1483.0.
Compare leaders on the best AI for reasoning and best AI for general leaderboards.
Current leaders
Grok-4.1 Thinking from xAI currently leads the LMArena Text Leaderboard leaderboard with a score of 1483.000 across 2 evaluated AI models.
FAQ
Common questions about the LMArena Text Leaderboard benchmark and leaderboard.