Are LLMs Court-Ready? Evaluating Frontier Models on Indian Legal Reasoning

Kush Juvekar; Arghya Bhattacharya; Sai Khadloya; Utkarsh Saxena

doi:10.18653/v1/2025.nllp-1.26

Are LLMs Court-Ready? Evaluating Frontier Models on Indian Legal Reasoning

Kush Juvekar, Arghya Bhattacharya, Sai Khadloya, Utkarsh Saxena

Abstract

Large language models (LLMs) are moving into legal workflows, yet we lack a jurisdiction-grounded way to gauge their basic competence in thereof. We use India’s public legal examinations as a transparent proxy. Our multi-year benchmark assembles objective screens from top national and state exams and evaluates open and frontier LLMs under real world exam conditions. To probe beyond MCQs, we also include a lawyer-graded, paired-blinded study of long-form answers from the Supreme Court’s Advocate-on-Record exam. This is, to our knowledge, the first exam-grounded, India-specific yardstick for LLM court-readiness released with datasets and protocols. Our work shows that while frontier systems consistently clear historical cutoffs and often match or exceed recent top-scorer bands on objective exams, none surpasses the human topper on long-form reasoning. Grader notes converge on three reliability failure modes—procedural/format compliance, authority/citation discipline, and forum-appropriate voice/structure. These findings delineate where LLMs can assist (checks, cross-statute consistency, statute and precedent lookups) and where human leadership remains essential: forum-specific drafting and filing, procedural and relief strategy, reconciling authorities and exceptions, and ethical, accountable judgment.

Anthology ID:: 2025.nllp-1.26
Volume:: Proceedings of the Natural Legal Language Processing Workshop 2025
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Nikolaos Aletras, Ilias Chalkidis, Leslie Barrett, Cătălina Goanță, Daniel Preoțiuc-Pietro, Gerasimos Spanakis
Venues:: NLLP | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 359–369
Language:
URL:: https://aclanthology.org/2025.nllp-1.26/
DOI:: 10.18653/v1/2025.nllp-1.26
Bibkey:
Cite (ACL):: Kush Juvekar, Arghya Bhattacharya, Sai Khadloya, and Utkarsh Saxena. 2025. Are LLMs Court-Ready? Evaluating Frontier Models on Indian Legal Reasoning. In Proceedings of the Natural Legal Language Processing Workshop 2025, pages 359–369, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Are LLMs Court-Ready? Evaluating Frontier Models on Indian Legal Reasoning (Juvekar et al., NLLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.nllp-1.26.pdf

PDF Cite Search Fix data