NESTLE: a No-Code Tool for Statistical Analysis of Legal Corpus

Kyoungyeon Cho, Seungkum Han, Young Rok Choi, Wonseok Hwang


Abstract
The statistical analysis of large scale legal corpus can provide valuable legal insights. For such analysis one needs to (1) select a subset of the corpus using document retrieval tools, (2) structure text using information extraction (IE) systems, and (3) visualize the data for the statistical analysis. Each process demands either specialized tools or programming skills whereas no comprehensive unified “no-code” tools have been available. Here we provide NESTLE, a no-code tool for large-scale statistical analysis of legal corpus. Powered by a Large Language Model (LLM) and the internal custom end-to-end IE system, NESTLE can extract any type of information that has not been predefined in the IE system opening up the possibility of unlimited customizable statistical analysis of the corpus without writing a single line of code. We validate our system on 15 Korean precedent IE tasks and 3 legal text classification tasks from LexGLUE. The comprehensive experiments reveal NESTLE can achieve GPT-4 comparable performance by training the internal IE module with 4 human-labeled, and 192 LLM-labeled examples.
Anthology ID:
2024.eacl-demo.7
Volume:
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations
Month:
March
Year:
2024
Address:
St. Julians, Malta
Editors:
Nikolaos Aletras, Orphee De Clercq
Venue:
EACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
52–61
Language:
URL:
https://aclanthology.org/2024.eacl-demo.7
DOI:
Bibkey:
Cite (ACL):
Kyoungyeon Cho, Seungkum Han, Young Rok Choi, and Wonseok Hwang. 2024. NESTLE: a No-Code Tool for Statistical Analysis of Legal Corpus. In Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, pages 52–61, St. Julians, Malta. Association for Computational Linguistics.
Cite (Informal):
NESTLE: a No-Code Tool for Statistical Analysis of Legal Corpus (Cho et al., EACL 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.eacl-demo.7.pdf