Evaluating Customized vs. Generalist Transformer-based Models for Legal Contract Classification

Amrita Singh; H. Suhan Karaca; Aditya Joshi; Hye-young Paik; Jiaojiao Jiang

Evaluating Customized vs. Generalist Transformer-based Models for Legal Contract Classification

Amrita Singh, H. Suhan Karaca, Aditya Joshi, Hye-young Paik, Jiaojiao Jiang

Abstract

Despite advances in legal NLP, no comprehensive evaluation of Transformer-based models customized for legal tasks (referred to as ’legal-specific’ models in this paper) exists for contract classification tasks. To address this gap, we present an evaluation of 13 legal-specific transformer-based models on 3 English-language contract classification tasks and compare them with 9 generalist models. The results show that legal-specific models consistently outperform generalist models, especially on tasks requiring nuanced legal understanding. They also help reduce misclassification of rare classes in imbalanced datasets. Legal-BERT and Contracts-BERT establish new SOTAs on two of the three tasks, despite having 69% fewer parameters than the best-performing generalist models. We also identify CaseLaw-BERT and LexLM as strong additional baselines for contract classification. Our results highlight the shortcomings of generalist models, emphasizing the need for domain-specific customization, particularly in the context of legal applications.

Anthology ID:: 2026.customnlp4u-1.5
Volume:: Proceedings of the Second Workshop on Customizable NLP: Progress and Challenges in Customizing NLP for a Domain, Application, Group, or Individual (CustomNLP4U)
Month:: July
Year:: 2026
Address:: San Diego, California, USA
Editors:: Sheshera Mysore, Sachin Kumar, Vidhisha Balachandran, Shirley Anugrah Hayati, Faeze Brahman, Hanane Nour Moussa, Alireza Salemi
Venues:: CustomNLP4U | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 44–54
Language:
URL:: https://aclanthology.org/2026.customnlp4u-1.5/
DOI:
Bibkey:
Cite (ACL):: Amrita Singh, H. Suhan Karaca, Aditya Joshi, Hye-young Paik, and Jiaojiao Jiang. 2026. Evaluating Customized vs. Generalist Transformer-based Models for Legal Contract Classification. In Proceedings of the Second Workshop on Customizable NLP: Progress and Challenges in Customizing NLP for a Domain, Application, Group, or Individual (CustomNLP4U), pages 44–54, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):: Evaluating Customized vs. Generalist Transformer-based Models for Legal Contract Classification (Singh et al., CustomNLP4U 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.customnlp4u-1.5.pdf

PDF Cite Search Fix data