VIBE-Pro
VIBE-Pro is an advanced version of the VIBE (Visual & Interactive Benchmark for Execution) benchmark that evaluates LLMs on professional-grade full-stack application development tasks. It measures model performance across complex real-world development scenarios including web, mobile, and backend applications with higher difficulty than the standard VIBE benchmark.
Progress Over Time
Interactive timeline showing model performance evolution on VIBE-Pro
State-of-the-art frontier
Open
Proprietary
VIBE-Pro Leaderboard
2 models
| Context | Cost | License | ||||
|---|---|---|---|---|---|---|
| 1 | MiniMax | — | — | — | ||
| 2 | MiniMax | 230B | 1.0M | $0.30 / $1.20 |
Notice missing or incorrect data?
FAQ
Common questions about VIBE-Pro
VIBE-Pro is an advanced version of the VIBE (Visual & Interactive Benchmark for Execution) benchmark that evaluates LLMs on professional-grade full-stack application development tasks. It measures model performance across complex real-world development scenarios including web, mobile, and backend applications with higher difficulty than the standard VIBE benchmark.
The VIBE-Pro leaderboard ranks 2 AI models based on their performance on this benchmark. Currently, MiniMax M2.7 by MiniMax leads with a score of 0.556. The average score across all models is 0.549.
The highest VIBE-Pro score is 0.556, achieved by MiniMax M2.7 from MiniMax.
2 models have been evaluated on the VIBE-Pro benchmark, with 0 verified results and 2 self-reported results.
VIBE-Pro is categorized under agents and code. The benchmark evaluates text models.