SWE-Atlas
Progress Over Time
Interactive timeline showing model performance evolution on SWE-Atlas
State-of-the-art frontier
Open
Proprietary
SWE-Atlas Leaderboard
2 models
| Context | Cost | License | ||||
|---|---|---|---|---|---|---|
| 1 | Seed 2.1 ProNew ByteDance | — | — | — | ||
| 2 | ByteDance | — | — | — |
Notice missing or incorrect data?
What is SWE-Atlas?
SWE-Atlas is a software engineering benchmark focused on debugging, evaluating a model's ability to localize and fix bugs in real-world codebases.
SWE-Atlas is a text benchmark evaluating models on agents and coding tasks. LLM Stats tracks 2 models on this benchmark, scored on a 0–1 scale. The current average is 0.3, with the leader at 0.4.
Compare leaders on the best AI for agents and best AI for coding leaderboards.
Current leaders
Seed 2.1 Pro from ByteDance currently leads the SWE-Atlas leaderboard with a score of 0.352 across 2 evaluated AI models.
FAQ
Common questions about the SWE-Atlas benchmark and leaderboard.