CL-bench

Name: CL-bench Leaderboard — AI Model Scores
Creator: LLM Stats
License: https://llm-stats.com/legal/terms-of-service

Progress Over Time

Interactive timeline showing model performance evolution on CL-bench

State-of-the-art frontier

Open

Proprietary

CL-bench Leaderboard

2 models

				Context	Cost	License
1	Hy3 Tencent		295B	—	—
2	MiniMax M3 MiniMax		—	1.0M	$0.30 / $1.20

Notice missing or incorrect data?

Sub-benchmarks

CL-bench (Life)

CL-bench Life variant.

text•Max 1

About this benchmark

What is CL-bench?

CL-bench is an open-source benchmark with its own data and rubrics for evaluating models on coding and agentic tasks, scored using a setup fully aligned with the official procedure.

CL-bench is a text benchmark evaluating models on agents and code tasks. LLM Stats tracks 2 models on this benchmark, scored on a 0–1 scale. The current average is 0.2, with the leader at 0.2.

Compare leaders on the best AI for agents and best AI for code leaderboards.

Current leaders

Hy3 from Tencent currently leads the CL-bench leaderboard with a score of 0.238 across 2 evaluated AI models.

Hy3Tencent23.8%

MiniMax M3MiniMax20.5%

FAQ

Common questions about the CL-bench benchmark and leaderboard.

What is the CL-bench benchmark?

CL-bench is an open-source benchmark with its own data and rubrics for evaluating models on coding and agentic tasks, scored using a setup fully aligned with the official procedure.

What is the CL-bench leaderboard?

The CL-bench leaderboard ranks 2 AI models based on their performance on this benchmark. Currently, Hy3 by Tencent leads with a score of 0.238. The average score across all models is 0.221.

What is the highest CL-bench score?

The highest CL-bench score is 0.238, achieved by Hy3 from Tencent.

How many models are evaluated on CL-bench?

2 models have been evaluated on the CL-bench benchmark, with 0 verified results and 2 self-reported results.

What categories does CL-bench cover?

CL-bench is categorized under agents and code. The benchmark evaluates text models.

Are there variants of CL-bench?

Yes. CL-bench has 1 related variant: CL-bench (Life).

What is the best open-source model on CL-bench?

Hy3 by Tencent is the top-ranked open-source model on CL-bench, with a score of 0.238 (rank #1).

How recent are the CL-bench leaderboard results?

The CL-bench leaderboard was last updated in July 2026 and currently includes 2 evaluated models.