ArcAGI2

Progress Over Time

Interactive timeline showing model performance evolution on ArcAGI2

State-of-the-art frontier
Open
Proprietary

ArcAGI2 Leaderboard

2 models
ContextCostLicense
1
ByteDance
ByteDance
2
ByteDance
ByteDance
Notice missing or incorrect data?
About this benchmark

What is ArcAGI2?

ARC-AGI-2 is the second-generation Abstraction and Reasoning Corpus benchmark measuring fluid, general reasoning and abstraction.

ArcAGI2 is a text benchmark evaluating models on reasoning tasks. LLM Stats tracks 2 models on this benchmark, scored on a 0–1 scale. The current average is 0.6, with the leader at 0.6.

Compare leaders on the best AI for reasoning leaderboards.

Current leaders

Seed 2.1 Pro from ByteDance currently leads the ArcAGI2 leaderboard with a score of 0.625 across 2 evaluated AI models.

1Seed 2.1 ProByteDance62.5%
2Seed 2.1 TurboByteDance61.3%

FAQ

Common questions about the ArcAGI2 benchmark and leaderboard.

What is the ArcAGI2 benchmark?

ARC-AGI-2 is the second-generation Abstraction and Reasoning Corpus benchmark measuring fluid, general reasoning and abstraction.

What is the ArcAGI2 leaderboard?

The ArcAGI2 leaderboard ranks 2 AI models based on their performance on this benchmark. Currently, Seed 2.1 Pro by ByteDance leads with a score of 0.625. The average score across all models is 0.619.

What is the highest ArcAGI2 score?

The highest ArcAGI2 score is 0.625, achieved by Seed 2.1 Pro from ByteDance.

How many models are evaluated on ArcAGI2?

2 models have been evaluated on the ArcAGI2 benchmark, with 0 verified results and 2 self-reported results.

What categories does ArcAGI2 cover?

ArcAGI2 is categorized under reasoning. The benchmark evaluates text models.

How recent are the ArcAGI2 leaderboard results?

The ArcAGI2 leaderboard was last updated in June 2026 and currently includes 2 evaluated models.