SWE-Review
Progress Over Time
Interactive timeline showing model performance evolution on SWE-Review
State-of-the-art frontier
Open
Proprietary
SWE-Review Leaderboard
1 models
| Context | Cost | License | ||||
|---|---|---|---|---|---|---|
| 1 | MiniMax | 230B | 1.0M | $0.30 / $1.20 |
Notice missing or incorrect data?
What is SWE-Review?
Software Engineering Review benchmark evaluating code review capabilities
SWE-Review is a text benchmark evaluating models on code tasks. LLM Stats tracks 1 models on this benchmark, scored on a 0–1 scale. The current average is 0.1, with the leader at 0.1.
Compare leaders on the best AI for code leaderboards.
Current leaders
MiniMax M2.1 from MiniMax currently leads the SWE-Review leaderboard with a score of 0.089 across 1 evaluated AI models.
FAQ
Common questions about the SWE-Review benchmark and leaderboard.