SWE Atlas - Test Writing

Name: SWE Atlas - Test Writing Leaderboard — AI Model Scores
Creator: LLM Stats
License: https://llm-stats.com/legal/terms-of-service

Progress Over Time

Interactive timeline showing model performance evolution on SWE Atlas - Test Writing

State-of-the-art frontier

Open

Proprietary

SWE Atlas - Test Writing Leaderboard

1 models

				Context	Cost	License
1	MiniMax M3 MiniMax		—	1.0M	$0.30 / $1.20

Notice missing or incorrect data?

About this benchmark

What is SWE Atlas - Test Writing?

SWE Atlas - Test Writing evaluates a model's ability to author meaningful tests for real-world software projects, measuring how well agents can understand code and produce correct, useful test coverage.

SWE Atlas - Test Writing is a text benchmark evaluating models on agents and code tasks. LLM Stats tracks 1 models on this benchmark, scored on a 0–1 scale. The current average is 0.3, with the leader at 0.3.

Compare leaders on the best AI for agents and best AI for code leaderboards.

Current leaders

MiniMax M3 from MiniMax currently leads the SWE Atlas - Test Writing leaderboard with a score of 0.308 across 1 evaluated AI models.

MiniMax M3MiniMax30.8%

FAQ

Common questions about the SWE Atlas - Test Writing benchmark and leaderboard.

What is the SWE Atlas - Test Writing benchmark?

What is the SWE Atlas - Test Writing leaderboard?

The SWE Atlas - Test Writing leaderboard ranks 1 AI models based on their performance on this benchmark. Currently, MiniMax M3 by MiniMax leads with a score of 0.308. The average score across all models is 0.308.

What is the highest SWE Atlas - Test Writing score?

The highest SWE Atlas - Test Writing score is 0.308, achieved by MiniMax M3 from MiniMax.

How many models are evaluated on SWE Atlas - Test Writing?

1 models have been evaluated on the SWE Atlas - Test Writing benchmark, with 0 verified results and 1 self-reported results.

What categories does SWE Atlas - Test Writing cover?

SWE Atlas - Test Writing is categorized under agents and code. The benchmark evaluates text models.

What is the best open-source model on SWE Atlas - Test Writing?

MiniMax M3 by MiniMax is the top-ranked open-source model on SWE Atlas - Test Writing, with a score of 0.308 (rank #1).

Which model offers the best value on SWE Atlas - Test Writing?

Among models scoring within 10% of the leader, MiniMax M3 from MiniMax is the cheapest, at $0.30 per million input tokens with a score of 0.308.

How recent are the SWE Atlas - Test Writing leaderboard results?

The SWE Atlas - Test Writing leaderboard was last updated in July 2026 and currently includes 1 evaluated models.