MLE-Bench Lite

Progress Over Time

Interactive timeline showing model performance evolution on MLE-Bench Lite

State-of-the-art frontier
Open
Proprietary

MLE-Bench Lite Leaderboard

1 models
ContextCostLicense
1205K$0.30 / $1.20
Notice missing or incorrect data?
About this benchmark

What is MLE-Bench Lite?

MLE-Bench Lite evaluates AI agents on machine learning engineering tasks, testing their ability to build, train, and optimize ML models for Kaggle-style competitions in a lightweight evaluation format.

MLE-Bench Lite is a text benchmark evaluating models on agents and coding tasks. LLM Stats tracks 1 models on this benchmark, scored on a 0–1 scale. The current average is 0.7, with the leader at 0.7.

Compare leaders on the best AI for agents and best AI for coding leaderboards.

Current leaders

MiniMax M2.7 from MiniMax currently leads the MLE-Bench Lite leaderboard with a score of 0.666 across 1 evaluated AI models.

1MiniMax M2.7MiniMax66.6%

FAQ

Common questions about the MLE-Bench Lite benchmark and leaderboard.

What is the MLE-Bench Lite benchmark?

MLE-Bench Lite evaluates AI agents on machine learning engineering tasks, testing their ability to build, train, and optimize ML models for Kaggle-style competitions in a lightweight evaluation format.

What is the MLE-Bench Lite leaderboard?

The MLE-Bench Lite leaderboard ranks 1 AI models based on their performance on this benchmark. Currently, MiniMax M2.7 by MiniMax leads with a score of 0.666. The average score across all models is 0.666.

What is the highest MLE-Bench Lite score?

The highest MLE-Bench Lite score is 0.666, achieved by MiniMax M2.7 from MiniMax.

How many models are evaluated on MLE-Bench Lite?

1 models have been evaluated on the MLE-Bench Lite benchmark, with 0 verified results and 1 self-reported results.

What categories does MLE-Bench Lite cover?

MLE-Bench Lite is categorized under agents and coding. The benchmark evaluates text models.

What is the best open-source model on MLE-Bench Lite?

MiniMax M2.7 by MiniMax is the top-ranked open-source model on MLE-Bench Lite, with a score of 0.666 (rank #1).

Which model offers the best value on MLE-Bench Lite?

Among models scoring within 10% of the leader, MiniMax M2.7 from MiniMax is the cheapest, at $0.30 per million input tokens with a score of 0.666.

How recent are the MLE-Bench Lite leaderboard results?

The MLE-Bench Lite leaderboard was last updated in July 2026 and currently includes 1 evaluated models.