LSAT

Paper

Progress Over Time

Interactive timeline showing model performance evolution on LSAT

State-of-the-art frontier
Open
Proprietary

LSAT Leaderboard

1 models
ContextCostLicense
1
OpenAI
OpenAI
Notice missing or incorrect data?
About this benchmark

What is LSAT?

LSAT (Law School Admission Test) benchmark evaluating complex reasoning capabilities across three challenging tasks: analytical reasoning, logical reasoning, and reading comprehension. The LSAT measures skills considered essential for success in law school including critical thinking, reading comprehension of complex texts, and analysis of arguments.

LSAT is a text benchmark evaluating models on legal, reasoning, and general tasks. LLM Stats tracks 1 models on this benchmark, scored on a 0–1 scale. The current average is 0.9, with the leader at 0.9.

Compare leaders on the best AI for legal, best AI for reasoning and best AI for general leaderboards.

Current leaders

GPT-4 from OpenAI currently leads the LSAT leaderboard with a score of 0.880 across 1 evaluated AI models.

1GPT-4OpenAI88.0%

Source paper

Title
From LSAT: The Progress and Challenges of Complex Reasoning
Authors
Siyuan Wang, Zhongkun Liu, Wanjun Zhong, Ming Zhou, and 3 others
Published
Abstract

Complex reasoning aims to draw a correct inference based on complex rules. As a hallmark of human intelligence, it involves a degree of explicit reading comprehension, interpretation of logical knowledge and complex rule application. In this paper, we take a step forward in complex reasoning by systematically studying the three challenging and domain-general tasks of the Law School Admission Test (LSAT), including analytical reasoning, logical reasoning and reading comprehension. We propose a hybrid reasoning system to integrate these three tasks and achieve impressive overall performance on the LSAT tests. The experimental results demonstrate that our system endows itself a certain complex reasoning ability, especially the fundamental reading comprehension and challenging logical reasoning capacities. Further analysis also shows the effectiveness of combining the pre-trained models with the task-specific reasoning module, and integrating symbolic knowledge into discrete interpretable reasoning steps in complex reasoning. We further shed a light on the potential future directions, like unsupervised symbolic knowledge extraction, model interpretability, few-shot learning and comprehensive benchmark for complex reasoning.

FAQ

Common questions about the LSAT benchmark and leaderboard.

What is the LSAT benchmark?

LSAT (Law School Admission Test) benchmark evaluating complex reasoning capabilities across three challenging tasks: analytical reasoning, logical reasoning, and reading comprehension. The LSAT measures skills considered essential for success in law school including critical thinking, reading comprehension of complex texts, and analysis of arguments.

What is the LSAT leaderboard?

The LSAT leaderboard ranks 1 AI models based on their performance on this benchmark. Currently, GPT-4 by OpenAI leads with a score of 0.880. The average score across all models is 0.880.

What is the highest LSAT score?

The highest LSAT score is 0.880, achieved by GPT-4 from OpenAI.

How many models are evaluated on LSAT?

1 models have been evaluated on the LSAT benchmark, with 0 verified results and 1 self-reported results.

Where can I find the LSAT paper?

The LSAT paper is available at https://arxiv.org/abs/2108.00648. The paper details the methodology, dataset construction, and evaluation criteria.

What categories does LSAT cover?

LSAT is categorized under legal, reasoning, and general. The benchmark evaluates text models.

How recent are the LSAT leaderboard results?

The LSAT leaderboard was last updated in July 2026 and currently includes 1 evaluated models.