MM-BrowserComp
Progress Over Time
Interactive timeline showing model performance evolution on MM-BrowserComp
MM-BrowserComp Leaderboard
| Context | Cost | License | ||||
|---|---|---|---|---|---|---|
| 1 | Xiaomi | — | — | — |
What is MM-BrowserComp?
MM-BrowserComp evaluates multimodal agents on web browsing and information retrieval tasks, testing a model's ability to perceive, navigate, and extract information from real web environments.
MM-BrowserComp is a multimodal benchmark evaluating models on multimodal, search, and agents tasks. LLM Stats tracks 1 models on this benchmark, scored on a 0–1 scale. The current average is 0.5, with the leader at 0.5.
Compare leaders on the best AI for multimodal, best AI for search and best AI for agents leaderboards.
Current leaders
MiMo-V2-Omni from Xiaomi currently leads the MM-BrowserComp leaderboard with a score of 0.520 across 1 evaluated AI models.
FAQ
Common questions about the MM-BrowserComp benchmark and leaderboard.