Why should only High-Resource-Languages have all the fun? Pivot Based Evaluation in Low Resource Setting

Ananya Mukherjee, Saumitra Yadav, Manish Shrivastava


Abstract
Evaluating machine translation (MT) systems for low-resource languages has long been a challenge due to the limited availability of evaluation metrics and resources. As a result, researchers in this space have relied primarily on lexical-based metrics like BLEU, TER, and ChrF, which lack semantic evaluation. In this first-of-its-kind work, we propose a novel pivot-based evaluation framework that addresses these limitations; after translating low-resource language outputs into a related high-resource language, we leverage advanced neural and embedding-based metrics for more meaningful evaluation. Through a series of experiments using five low-resource languages: Assamese, Manipuri, Kannada, Bhojpuri, and Nepali, we demonstrate how this method extends the coverage of both lexical-based and embedding-based metrics, even for languages not directly supported by advanced metrics. Our results show that the differences between direct and pivot-based evaluation scores are minimal, proving that this approach is a viable and effective solution for evaluating translations in endangered and low-resource languages. This work paves the way for more inclusive, accurate, and scalable MT evaluation for underrepresented languages, marking a significant step forward in this under-explored area of research. The code and data will be made available at https://github.com/AnanyaCoder/PivotBasedEvaluation.
Anthology ID:
2025.coling-main.320
Volume:
Proceedings of the 31st International Conference on Computational Linguistics
Month:
January
Year:
2025
Address:
Abu Dhabi, UAE
Editors:
Owen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa, Barbara Di Eugenio, Steven Schockaert
Venue:
COLING
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
4779–4788
Language:
URL:
https://aclanthology.org/2025.coling-main.320/
DOI:
Bibkey:
Cite (ACL):
Ananya Mukherjee, Saumitra Yadav, and Manish Shrivastava. 2025. Why should only High-Resource-Languages have all the fun? Pivot Based Evaluation in Low Resource Setting. In Proceedings of the 31st International Conference on Computational Linguistics, pages 4779–4788, Abu Dhabi, UAE. Association for Computational Linguistics.
Cite (Informal):
Why should only High-Resource-Languages have all the fun? Pivot Based Evaluation in Low Resource Setting (Mukherjee et al., COLING 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.coling-main.320.pdf