VLMsAreBiased
Progress Over Time
Interactive timeline showing model performance evolution on VLMsAreBiased
State-of-the-art frontier
Open
Proprietary
VLMsAreBiased Leaderboard
2 models
| Context | Cost | License | ||||
|---|---|---|---|---|---|---|
| 1 | Seed 2.1 ProNew ByteDance | — | — | — | ||
| 2 | ByteDance | — | — | — |
Notice missing or incorrect data?
What is VLMsAreBiased?
VLMsAreBiased evaluates whether vision-language models rely on visual evidence or fall back on language priors when answering.
VLMsAreBiased is a multimodal benchmark evaluating models on multimodal and vision tasks. LLM Stats tracks 2 models on this benchmark, scored on a 0–1 scale. The current average is 0.8, with the leader at 0.8.
Compare leaders on the best AI for multimodal and best AI for vision leaderboards.
Current leaders
Seed 2.1 Pro from ByteDance currently leads the VLMsAreBiased leaderboard with a score of 0.836 across 2 evaluated AI models.
FAQ
Common questions about the VLMsAreBiased benchmark and leaderboard.