MobileWorld
Progress Over Time
Interactive timeline showing model performance evolution on MobileWorld
MobileWorld Leaderboard
| Context | Cost | License | ||||
|---|---|---|---|---|---|---|
| 1 | Seed 2.1 ProNew ByteDance | — | — | — | ||
| 2 | ByteDance | — | — | — |
What is MobileWorld?
MobileWorld is a benchmark for evaluating multimodal agents on real mobile-device tasks, testing GUI grounding, navigation, and multi-step task completion in mobile environments.
MobileWorld is a multimodal benchmark evaluating models on multimodal, agents, and vision tasks. LLM Stats tracks 2 models on this benchmark, scored on a 0–1 scale. The current average is 0.7, with the leader at 0.7.
Compare leaders on the best AI for multimodal, best AI for agents and best AI for vision leaderboards.
Current leaders
Seed 2.1 Pro from ByteDance currently leads the MobileWorld leaderboard with a score of 0.731 across 2 evaluated AI models.
FAQ
Common questions about the MobileWorld benchmark and leaderboard.