Kimi Claw 24/7 Bench

Name: Kimi Claw 24/7 Bench Leaderboard — AI Model Scores
Creator: LLM Stats
License: https://llm-stats.com/legal/terms-of-service

Progress Over Time

Interactive timeline showing model performance evolution on Kimi Claw 24/7 Bench

State-of-the-art frontier

Open

Proprietary

Kimi Claw 24/7 Bench Leaderboard

1 models

				Context	Cost	License
1	Kimi K2.7 Code Moonshot AI		1.0T	262K	$0.74 / $3.50

Notice missing or incorrect data?

About this benchmark

What is Kimi Claw 24/7 Bench?

Kimi Claw 24/7 Bench is Moonshot AI's in-house benchmark for evaluating long-horizon agentic performance in persistent, multi-day coworking tasks. It spans 17 professional scenarios across 610 evaluation points, covering software engineering, ML research, recruiting, trading, and marketing tasks executed through the OpenClaw harness.

Kimi Claw 24/7 Bench is a text benchmark evaluating models on agents and code tasks. LLM Stats tracks 1 models on this benchmark, scored on a 0–1 scale. The current average is 0.5, with the leader at 0.5.

Compare leaders on the best AI for agents and best AI for code leaderboards.

Current leaders

Kimi K2.7 Code from Moonshot AI currently leads the Kimi Claw 24/7 Bench leaderboard with a score of 0.469 across 1 evaluated AI models.

Kimi K2.7 CodeMoonshot AI46.9%

FAQ

Common questions about the Kimi Claw 24/7 Bench benchmark and leaderboard.

What is the Kimi Claw 24/7 Bench benchmark?

What is the Kimi Claw 24/7 Bench leaderboard?

The Kimi Claw 24/7 Bench leaderboard ranks 1 AI models based on their performance on this benchmark. Currently, Kimi K2.7 Code by Moonshot AI leads with a score of 0.469. The average score across all models is 0.469.

What is the highest Kimi Claw 24/7 Bench score?

The highest Kimi Claw 24/7 Bench score is 0.469, achieved by Kimi K2.7 Code from Moonshot AI.

How many models are evaluated on Kimi Claw 24/7 Bench?

1 models have been evaluated on the Kimi Claw 24/7 Bench benchmark, with 0 verified results and 1 self-reported results.

What categories does Kimi Claw 24/7 Bench cover?

Kimi Claw 24/7 Bench is categorized under agents and code. The benchmark evaluates text models.

What's the difference between Kimi Claw 24/7 Bench and Claw-Eval?

Kimi Claw 24/7 Bench is a variant of Claw-Eval. See the Claw-Eval leaderboard for the broader benchmark and per-model comparison.

What is the best open-source model on Kimi Claw 24/7 Bench?

Kimi K2.7 Code by Moonshot AI is the top-ranked open-source model on Kimi Claw 24/7 Bench, with a score of 0.469 (rank #1).

Which model offers the best value on Kimi Claw 24/7 Bench?

Among models scoring within 10% of the leader, Kimi K2.7 Code from Moonshot AI is the cheapest, at $0.74 per million input tokens with a score of 0.469.

How recent are the Kimi Claw 24/7 Bench leaderboard results?

The Kimi Claw 24/7 Bench leaderboard was last updated in July 2026 and currently includes 1 evaluated models.