SWE-Review

Progress Over Time

Interactive timeline showing model performance evolution on SWE-Review

State-of-the-art frontier
Open
Proprietary

SWE-Review Leaderboard

1 models
ContextCostLicense
1230B1.0M$0.30 / $1.20
Notice missing or incorrect data?
About this benchmark

What is SWE-Review?

Software Engineering Review benchmark evaluating code review capabilities

SWE-Review is a text benchmark evaluating models on code tasks. LLM Stats tracks 1 models on this benchmark, scored on a 0–1 scale. The current average is 0.1, with the leader at 0.1.

Compare leaders on the best AI for code leaderboards.

Current leaders

MiniMax M2.1 from MiniMax currently leads the SWE-Review leaderboard with a score of 0.089 across 1 evaluated AI models.

1MiniMax M2.1MiniMax8.9%

FAQ

Common questions about the SWE-Review benchmark and leaderboard.

What is the SWE-Review benchmark?

Software Engineering Review benchmark evaluating code review capabilities

What is the SWE-Review leaderboard?

The SWE-Review leaderboard ranks 1 AI models based on their performance on this benchmark. Currently, MiniMax M2.1 by MiniMax leads with a score of 0.089. The average score across all models is 0.089.

What is the highest SWE-Review score?

The highest SWE-Review score is 0.089, achieved by MiniMax M2.1 from MiniMax.

How many models are evaluated on SWE-Review?

1 models have been evaluated on the SWE-Review benchmark, with 0 verified results and 1 self-reported results.

What categories does SWE-Review cover?

SWE-Review is categorized under code. The benchmark evaluates text models.

What is the best open-source model on SWE-Review?

MiniMax M2.1 by MiniMax is the top-ranked open-source model on SWE-Review, with a score of 0.089 (rank #1).

Which model offers the best value on SWE-Review?

Among models scoring within 10% of the leader, MiniMax M2.1 from MiniMax is the cheapest, at $0.30 per million input tokens with a score of 0.089.

How recent are the SWE-Review leaderboard results?

The SWE-Review leaderboard was last updated in July 2026 and currently includes 1 evaluated models.
SWE-Review Leaderboard