The CLC-UKET Dataset: Benchmarking Case Outcome Prediction for the UK Employment Tribunal

Huiyuan Xie, Felix Steffek, Joana De Faria, Christine Carter, Jonathan Rutherford


Abstract
This paper explores the intersection of technological innovation and access to justice by developing a benchmark for predicting case outcomes in the UK Employment Tribunal (UKET). To address the challenge of extensive manual annotation, the study employs a large language model (LLM) for automatic annotation, resulting in the creation of the CLC-UKET dataset. The dataset consists of approximately 19,000 UKET cases and their metadata. Comprehensive legal annotations cover facts, claims, precedent references, statutory references, case outcomes, reasons and jurisdiction codes. Facilitated by the CLC-UKET data, we examine a multi-class case outcome prediction task in the UKET. Human predictions are collected to establish a performance reference for model comparison. Empirical results from baseline models indicate that finetuned transformer models outperform zero-shot and few-shot LLMs on the UKET prediction task. The performance of zero-shot LLMs can be enhanced by integrating task-related information into few-shot examples. We hope that the CLC-UKET dataset, along with human annotations and empirical findings, can serve as a valuable benchmark for employment-related dispute resolution.
Anthology ID:
2024.nllp-1.7
Volume:
Proceedings of the Natural Legal Language Processing Workshop 2024
Month:
November
Year:
2024
Address:
Miami, FL, USA
Editors:
Nikolaos Aletras, Ilias Chalkidis, Leslie Barrett, Cătălina Goanță, Daniel Preoțiuc-Pietro, Gerasimos Spanakis
Venue:
NLLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
81–96
Language:
URL:
https://aclanthology.org/2024.nllp-1.7
DOI:
Bibkey:
Cite (ACL):
Huiyuan Xie, Felix Steffek, Joana De Faria, Christine Carter, and Jonathan Rutherford. 2024. The CLC-UKET Dataset: Benchmarking Case Outcome Prediction for the UK Employment Tribunal. In Proceedings of the Natural Legal Language Processing Workshop 2024, pages 81–96, Miami, FL, USA. Association for Computational Linguistics.
Cite (Informal):
The CLC-UKET Dataset: Benchmarking Case Outcome Prediction for the UK Employment Tribunal (Xie et al., NLLP 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.nllp-1.7.pdf