CFEval

Progress Over Time

Interactive timeline showing model performance evolution on CFEval

State-of-the-art frontier
Open
Proprietary

CFEval Leaderboard

2 models
ContextCostLicense
1
Alibaba Cloud / Qwen Team
Alibaba Cloud / Qwen Team
235B
2
Alibaba Cloud / Qwen Team
Alibaba Cloud / Qwen Team
80B
Notice missing or incorrect data?
About this benchmark

What is CFEval?

CFEval benchmark for evaluating code generation and problem-solving capabilities

CFEval is a text benchmark evaluating models on code tasks. LLM Stats tracks 2 models on this benchmark, scored on a 0–10000 scale. The current average is 2102.5, with the leader at 2134.0.

Compare leaders on the best AI for code leaderboards.

Current leaders

Qwen3-235B-A22B-Thinking-2507 from Alibaba Cloud / Qwen Team currently leads the CFEval leaderboard with a score of 2134.000 across 2 evaluated AI models.

1Qwen3-235B-A22B-Thinking-2507Alibaba Cloud / Qwen Team2134.000
2Qwen3-Next-80B-A3B-ThinkingAlibaba Cloud / Qwen Team2071.000

FAQ

Common questions about the CFEval benchmark and leaderboard.

What is the CFEval benchmark?

CFEval benchmark for evaluating code generation and problem-solving capabilities

What is the CFEval leaderboard?

The CFEval leaderboard ranks 2 AI models based on their performance on this benchmark. Currently, Qwen3-235B-A22B-Thinking-2507 by Alibaba Cloud / Qwen Team leads with a score of 2134.000. The average score across all models is 2102.500.

What is the highest CFEval score?

The highest CFEval score is 2134.000, achieved by Qwen3-235B-A22B-Thinking-2507 from Alibaba Cloud / Qwen Team.

How many models are evaluated on CFEval?

2 models have been evaluated on the CFEval benchmark, with 0 verified results and 2 self-reported results.

What categories does CFEval cover?

CFEval is categorized under code. The benchmark evaluates text models.

What is the best open-source model on CFEval?

Qwen3-235B-A22B-Thinking-2507 by Alibaba Cloud / Qwen Team is the top-ranked open-source model on CFEval, with a score of 2134.000 (rank #1).

How recent are the CFEval leaderboard results?

The CFEval leaderboard was last updated in July 2026 and currently includes 2 evaluated models.