MCP Atlas

Progress Over Time

Interactive timeline showing model performance evolution on MCP Atlas

State-of-the-art frontier
Open
Proprietary

MCP Atlas Leaderboard

27 models
ContextCostLicense
1
ByteDance
ByteDance
21.0M$1.50 / $9.00
31.0M$5.00 / $25.00
4
51.0M$5.00 / $25.00
6
Zhipu AI
Zhipu AI
753B1.0M$0.95 / $3.00
7
Alibaba Cloud / Qwen Team
Alibaba Cloud / Qwen Team
1.0M$1.25 / $3.75
8
Moonshot AI
Moonshot AI
1.0T262K$0.74 / $3.50
9
OpenAI
OpenAI
1.1M$5.00 / $30.00
10
MiniMax
MiniMax
1.0M$0.30 / $1.20
11
Alibaba Cloud / Qwen Team
Alibaba Cloud / Qwen Team
1.0M$0.50 / $3.00
121.6T1.0M$1.60 / $3.20
13
Alibaba Cloud / Qwen Team
Alibaba Cloud / Qwen Team
1.0M$0.32 / $1.28
14
Zhipu AI
Zhipu AI
754B200K$1.40 / $4.40
151.0M$2.50 / $15.00
16284B1.0M$0.10 / $0.20
17
Zhipu AI
Zhipu AI
744B200K$1.00 / $3.20
18
OpenAI
OpenAI
1.0M$2.50 / $15.00
19
Alibaba Cloud / Qwen Team
Alibaba Cloud / Qwen Team
35B
201.0M$5.00 / $25.00
21
22200K$3.00 / $15.00
23
OpenAI
OpenAI
400K$1.75 / $14.00
24400K$0.75 / $4.50
251.0M$0.50 / $3.00
26400K$0.20 / $1.25
271.0M$0.30 / $2.50
Notice missing or incorrect data?
About this benchmark

What is MCP Atlas?

MCP Atlas is a benchmark for evaluating AI models on scaled tool use capabilities, measuring how well models can coordinate and utilize multiple tools across complex multi-step tasks.

MCP Atlas is a text benchmark evaluating models on reasoning, agents, code, and tool calling tasks. LLM Stats tracks 27 models on this benchmark, scored on a 0–1 scale. The current average is 0.7, with the leader at 0.8.

Compare leaders on the best AI for reasoning, best AI for agents, best AI for code and best AI for tool calling leaderboards.

Current leaders

Seed 2.1 Pro from ByteDance currently leads the MCP Atlas leaderboard with a score of 0.838 across 27 evaluated AI models.

1Seed 2.1 ProByteDance83.8%
2Gemini 3.5 FlashGoogle83.6%
3Claude Opus 4.8Anthropic82.2%
OSSGLM-5.2#6 open-weight76.8%

FAQ

Common questions about the MCP Atlas benchmark and leaderboard.

What is the MCP Atlas benchmark?

MCP Atlas is a benchmark for evaluating AI models on scaled tool use capabilities, measuring how well models can coordinate and utilize multiple tools across complex multi-step tasks.

What is the MCP Atlas leaderboard?

The MCP Atlas leaderboard ranks 27 AI models based on their performance on this benchmark. Currently, Seed 2.1 Pro by ByteDance leads with a score of 0.838. The average score across all models is 0.688.

What is the highest MCP Atlas score?

The highest MCP Atlas score is 0.838, achieved by Seed 2.1 Pro from ByteDance.

How many models are evaluated on MCP Atlas?

27 models have been evaluated on the MCP Atlas benchmark, with 0 verified results and 27 self-reported results.

What categories does MCP Atlas cover?

MCP Atlas is categorized under reasoning, agents, code, and tool calling. The benchmark evaluates text models.

What is the best open-source model on MCP Atlas?

GLM-5.2 by Zhipu AI is the top-ranked open-source model on MCP Atlas, with a score of 0.768 (rank #6).

Which model offers the best value on MCP Atlas?

Among models scoring within 10% of the leader, Kimi K2.7 Code from Moonshot AI is the cheapest, at $0.74 per million input tokens with a score of 0.760.

How recent are the MCP Atlas leaderboard results?

The MCP Atlas leaderboard was last updated in July 2026 and currently includes 27 evaluated models.