Findings of the JUST-NLP 2025 Shared Task on English-to-Hindi Legal Machine Translation

Kshetrimayum Boynao Singh; Sandeep Kumar; Debtanu Datta; Abhinav Joshi; Shivani Mishra; Shounak Paul; Pawan Goyal; Sarika Jain; Saptarshi Ghosh; Ashutosh Modi; Asif Ekbal

Findings of the JUST-NLP 2025 Shared Task on English-to-Hindi Legal Machine Translation

Kshetrimayum Boynao Singh, Sandeep Kumar, Debtanu Datta, Abhinav Joshi, Shivani Mishra, Shounak Paul, Pawan Goyal, Sarika Jain, Saptarshi Ghosh, Ashutosh Modi, Asif Ekbal

Abstract

This paper provides an overview of the Shared Task on Legal Machine Translation (L-MT), organized as part of the JUST-NLP 2025 Workshop at IJCNLP-AACL 2025, aimed at improving the translation of legal texts, a domain where precision, structural faithfulness, and terminology preservation are essential. The training set comprises 50,000 sentences, with 5,000 sentences each for the validation and test sets. The submissions employed strategies such as: domain-adaptive fine-tuning of multilingual models, QLoRA-based parameter-efficient adaptation, curriculum-guided supervised training, reinforcement learning with verifiable MT metrics, and from-scratch Transformer training. The systems are evaluated based on BLEU, METEOR, TER, chrF++, BERTScore, and COMET metrics. We also combine the scores of these metrics to give an average score (AutoRank). The top-performing system is based on a fine-tuned distilled NLLB-200 model and achieved the highest AutoRank score of 72.1. Domain adaptation consistently yielded substantial improvements over baseline models, and precision-focused rewards proved especially effective for the legal MT. The findings also highlight that large multilingual Transformers can deliver accurate and reliable English-to-Hindi legal translations when carefully fine-tuned on legal data, advancing the broader goal of improving access to justice in multilingual settings.

Anthology ID:: 2025.justnlp-main.3
Volume:: Proceedings of the 1st Workshop on NLP for Empowering Justice (JUST-NLP 2025)
Month:: December
Year:: 2025
Address:: Mumbai, India
Editors:: Ashutosh Modi, Saptarshi Ghosh, Asif Ekbal, Pawan Goyal, Sarika Jain, Abhinav Joshi, Shivani Mishra, Debtanu Datta, Shounak Paul, Kshetrimayum Boynao Singh, Sandeep Kumar
Venues:: JUSTNLP | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 12–17
Language:
URL:: https://aclanthology.org/2025.justnlp-main.3/
DOI:
Bibkey:
Cite (ACL):: Kshetrimayum Boynao Singh, Sandeep Kumar, Debtanu Datta, Abhinav Joshi, Shivani Mishra, Shounak Paul, Pawan Goyal, Sarika Jain, Saptarshi Ghosh, Ashutosh Modi, and Asif Ekbal. 2025. Findings of the JUST-NLP 2025 Shared Task on English-to-Hindi Legal Machine Translation. In Proceedings of the 1st Workshop on NLP for Empowering Justice (JUST-NLP 2025), pages 12–17, Mumbai, India. Association for Computational Linguistics.
Cite (Informal):: Findings of the JUST-NLP 2025 Shared Task on English-to-Hindi Legal Machine Translation (Singh et al., JUSTNLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.justnlp-main.3.pdf

PDF Cite Search Fix data