MLE-Bench Lite
Progress Over Time
Interactive timeline showing model performance evolution on MLE-Bench Lite
MLE-Bench Lite Leaderboard
| Context | Cost | License | ||||
|---|---|---|---|---|---|---|
| 1 | MiniMax | — | 205K | $0.30 / $1.20 |
What is MLE-Bench Lite?
MLE-Bench Lite evaluates AI agents on machine learning engineering tasks, testing their ability to build, train, and optimize ML models for Kaggle-style competitions in a lightweight evaluation format.
MLE-Bench Lite is a text benchmark evaluating models on agents and coding tasks. LLM Stats tracks 1 models on this benchmark, scored on a 0–1 scale. The current average is 0.7, with the leader at 0.7.
Compare leaders on the best AI for agents and best AI for coding leaderboards.
Current leaders
MiniMax M2.7 from MiniMax currently leads the MLE-Bench Lite leaderboard with a score of 0.666 across 1 evaluated AI models.
FAQ
Common questions about the MLE-Bench Lite benchmark and leaderboard.