VIBE-Pro

VIBE-Pro is an advanced version of the VIBE (Visual & Interactive Benchmark for Execution) benchmark that evaluates LLMs on professional-grade full-stack application development tasks. It measures model performance across complex real-world development scenarios including web, mobile, and backend applications with higher difficulty than the standard VIBE benchmark.

Progress Over Time

Interactive timeline showing model performance evolution on VIBE-Pro

State-of-the-art frontier
Open
Proprietary

VIBE-Pro Leaderboard

2 models
ContextCostLicense
1
2230B1.0M$0.30 / $1.20
Notice missing or incorrect data?

FAQ

Common questions about VIBE-Pro

VIBE-Pro is an advanced version of the VIBE (Visual & Interactive Benchmark for Execution) benchmark that evaluates LLMs on professional-grade full-stack application development tasks. It measures model performance across complex real-world development scenarios including web, mobile, and backend applications with higher difficulty than the standard VIBE benchmark.
The VIBE-Pro leaderboard ranks 2 AI models based on their performance on this benchmark. Currently, MiniMax M2.7 by MiniMax leads with a score of 0.556. The average score across all models is 0.549.
The highest VIBE-Pro score is 0.556, achieved by MiniMax M2.7 from MiniMax.
2 models have been evaluated on the VIBE-Pro benchmark, with 0 verified results and 2 self-reported results.
VIBE-Pro is categorized under agents and code. The benchmark evaluates text models.