VIBE

Name: VIBE Leaderboard — AI Model Scores
Creator: LLM Stats
License: https://llm-stats.com/legal/terms-of-service

Progress Over Time

Interactive timeline showing model performance evolution on VIBE

State-of-the-art frontier

Open

Proprietary

VIBE Leaderboard

1 models

				Context	Cost	License
1	MiniMax M2.1 MiniMax		230B	1.0M	$0.30 / $1.20

Notice missing or incorrect data?

Sub-benchmarks

VIBE-Pro

VIBE-Pro is an advanced version of the VIBE (Visual & Interactive Benchmark for Execution) benchmark that evaluates LLMs on professional-grade full-stack application development tasks. It measures model performance across complex real-world development scenarios including web, mobile, and backend applications with higher difficulty than the standard VIBE benchmark.

text•Max 1

VIBE-V2

VIBE-V2 is an internal benchmark covering pure front-end and full-stack Web, Android, and iOS projects with build-from-scratch tasks. It uses an Agent-as-a-Verifier paradigm to automatically verify program interaction logic and visual output, scoring models through a unified pipeline that includes a requirement set, containerized deployment, and a dynamic interaction environment.

text•Max 1

About this benchmark

What is VIBE?

Visual Interface Building Evaluation benchmark for UI/app generation

VIBE is a text benchmark evaluating models on code tasks. LLM Stats tracks 1 models on this benchmark, scored on a 0–1 scale. The current average is 0.9, with the leader at 0.9.

Compare leaders on the best AI for code leaderboards.

Current leaders

MiniMax M2.1 from MiniMax currently leads the VIBE leaderboard with a score of 0.886 across 1 evaluated AI models.

MiniMax M2.1MiniMax88.6%

FAQ

Common questions about the VIBE benchmark and leaderboard.

What is the VIBE benchmark?

Visual Interface Building Evaluation benchmark for UI/app generation

What is the VIBE leaderboard?

The VIBE leaderboard ranks 1 AI models based on their performance on this benchmark. Currently, MiniMax M2.1 by MiniMax leads with a score of 0.886. The average score across all models is 0.886.

What is the highest VIBE score?

The highest VIBE score is 0.886, achieved by MiniMax M2.1 from MiniMax.

How many models are evaluated on VIBE?

1 models have been evaluated on the VIBE benchmark, with 0 verified results and 1 self-reported results.

What categories does VIBE cover?

VIBE is categorized under code. The benchmark evaluates text models.

Are there variants of VIBE?

Yes. VIBE has 2 related variants: VIBE-Pro, VIBE-V2.

What is the best open-source model on VIBE?

MiniMax M2.1 by MiniMax is the top-ranked open-source model on VIBE, with a score of 0.886 (rank #1).

Which model offers the best value on VIBE?

Among models scoring within 10% of the leader, MiniMax M2.1 from MiniMax is the cheapest, at $0.30 per million input tokens with a score of 0.886.

How recent are the VIBE leaderboard results?

The VIBE leaderboard was last updated in July 2026 and currently includes 1 evaluated models.