ContPhy
Progress Over Time
Interactive timeline showing model performance evolution on ContPhy
ContPhy Leaderboard
| Context | Cost | License | ||||
|---|---|---|---|---|---|---|
| 1 | Seed 2.1 ProNew ByteDance | — | — | — | ||
| 2 | ByteDance | — | — | — |
What is ContPhy?
ContPhy is a continuum physical-reasoning benchmark evaluating understanding of physical dynamics in video.
ContPhy is a multimodal benchmark evaluating models on multimodal, reasoning, video, and vision tasks. LLM Stats tracks 2 models on this benchmark, scored on a 0–1 scale. The current average is 0.6, with the leader at 0.6.
Compare leaders on the best AI for multimodal, best AI for reasoning, best AI for video and best AI for vision leaderboards.
Current leaders
Seed 2.1 Pro from ByteDance currently leads the ContPhy leaderboard with a score of 0.636 across 2 evaluated AI models.
FAQ
Common questions about the ContPhy benchmark and leaderboard.