LSAT
What is LSAT?
LSAT (Law School Admission Test) benchmark evaluating complex reasoning capabilities across three challenging tasks: analytical reasoning, logical reasoning, and reading comprehension. The LSAT measures skills considered essential for success in law school including critical thinking, reading comprehension of complex texts, and analysis of arguments.
LSAT is a text benchmark evaluating models on legal, reasoning, and general tasks. LLM Stats tracks 1 models on this benchmark, scored on a 0–1 scale. The current average is 0.9, with the leader at 0.9.
Compare leaders on the best AI for legal, best AI for reasoning and best AI for general leaderboards.
Current leaders
GPT-4 from OpenAI currently leads the LSAT leaderboard with a score of 0.880 across 1 evaluated AI models.
Source paper
- Title
- From LSAT: The Progress and Challenges of Complex Reasoning
- Authors
- Siyuan Wang, Zhongkun Liu, Wanjun Zhong, Ming Zhou, and 3 others
- Published
- arXiv
- 2108.00648
Abstract
Complex reasoning aims to draw a correct inference based on complex rules. As a hallmark of human intelligence, it involves a degree of explicit reading comprehension, interpretation of logical knowledge and complex rule application. In this paper, we take a step forward in complex reasoning by systematically studying the three challenging and domain-general tasks of the Law School Admission Test (LSAT), including analytical reasoning, logical reasoning and reading comprehension. We propose a hybrid reasoning system to integrate these three tasks and achieve impressive overall performance on the LSAT tests. The experimental results demonstrate that our system endows itself a certain complex reasoning ability, especially the fundamental reading comprehension and challenging logical reasoning capacities. Further analysis also shows the effectiveness of combining the pre-trained models with the task-specific reasoning module, and integrating symbolic knowledge into discrete interpretable reasoning steps in complex reasoning. We further shed a light on the potential future directions, like unsupervised symbolic knowledge extraction, model interpretability, few-shot learning and comprehensive benchmark for complex reasoning.
FAQ
Common questions about the LSAT benchmark and leaderboard.