About this benchmark

What is VIBE?

Visual Interface Building Evaluation benchmark for UI/app generation

VIBE is a text benchmark evaluating models on code tasks. LLM Stats tracks 1 models on this benchmark, scored on a 0–1 scale. The current average is 0.9, with the leader at 0.9.

Compare leaders on the best AI for code leaderboards.

Current leaders

MiniMax M2.1 from MiniMax currently leads the VIBE leaderboard with a score of 0.886 across 1 evaluated AI models.

1MiniMax M2.1MiniMax88.6%

FAQ

Common questions about the VIBE benchmark and leaderboard.

What is the VIBE benchmark?

Visual Interface Building Evaluation benchmark for UI/app generation

What is the VIBE leaderboard?

The VIBE leaderboard ranks 1 AI models based on their performance on this benchmark. Currently, MiniMax M2.1 by MiniMax leads with a score of 0.886. The average score across all models is 0.886.

What is the highest VIBE score?

The highest VIBE score is 0.886, achieved by MiniMax M2.1 from MiniMax.

How many models are evaluated on VIBE?

1 models have been evaluated on the VIBE benchmark, with 0 verified results and 1 self-reported results.

What categories does VIBE cover?

VIBE is categorized under code. The benchmark evaluates text models.

Are there variants of VIBE?

Yes. VIBE has 2 related variants: VIBE-Pro, VIBE-V2.

What is the best open-source model on VIBE?

MiniMax M2.1 by MiniMax is the top-ranked open-source model on VIBE, with a score of 0.886 (rank #1).

Which model offers the best value on VIBE?

Among models scoring within 10% of the leader, MiniMax M2.1 from MiniMax is the cheapest, at $0.30 per million input tokens with a score of 0.886.

How recent are the VIBE leaderboard results?

The VIBE leaderboard was last updated in July 2026 and currently includes 1 evaluated models.