The Benefits of Being Uncertain: Perplexity as a Signal for Naturalness in Multilingual Machine Translation

Timothy Pistotti; Michael J. Witbrock; Dr Padriac Amato Tahua O’Leary; Jason Katz-Brown

doi:10.18653/v1/2025.uncertainlp-main.7

The Benefits of Being Uncertain: Perplexity as a Signal for Naturalness in Multilingual Machine Translation

Timothy Pistotti, Michael J. Witbrock, Dr Padriac Amato Tahua O’Leary, Jason Brown

Abstract

Model-internal uncertainty metrics like perplexity potentially offer low-cost signals for Machine Translation Quality Estimation (TQE). This paper analyses perplexity in the No Language Left Behind (NLLB) multilingual model. We quantify a significant model-human perplexity gap, where the model is consistently more confident in its own, often literal, machine-generated translation than in diverse, high-quality human versions. We then demonstrate that the utility of perplexity as a TQE signal is highly context-dependent, being strongest for low-resource pairs. Finally, we present an illustrative case study where a flawed translation is refined by providing potentially useful information in a targeted prompt, simulating a knowledge-based repair. We show that as the translation’s quality and naturalness improve (a +0.15 COMET score increase), its perplexity also increases, challenging the simple assumption that lower perplexity indicates higher quality and motivating a more nuanced view of uncertainty as signalling a text’s departure from rigid translationese.

Anthology ID:: 2025.uncertainlp-main.7
Volume:: Proceedings of the 2nd Workshop on Uncertainty-Aware NLP (UncertaiNLP 2025)
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Bryan Eikema, Raúl Vázquez, Jonathan Berant, Marie-Catherine de Marneffe, Barbara Plank, Artem Shelmanov, Swabha Swayamdipta, Jörg Tiedemann, Chrysoula Zerva, Wilker Aziz
Venues:: UncertaiNLP | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 61–65
Language:
URL:: https://aclanthology.org/2025.uncertainlp-main.7/
DOI:: 10.18653/v1/2025.uncertainlp-main.7
Bibkey:
Cite (ACL):: Timothy Pistotti, Michael J. Witbrock, Dr Padriac Amato Tahua O’Leary, and Jason Brown. 2025. The Benefits of Being Uncertain: Perplexity as a Signal for Naturalness in Multilingual Machine Translation. In Proceedings of the 2nd Workshop on Uncertainty-Aware NLP (UncertaiNLP 2025), pages 61–65, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: The Benefits of Being Uncertain: Perplexity as a Signal for Naturalness in Multilingual Machine Translation (Pistotti et al., UncertaiNLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.uncertainlp-main.7.pdf

PDF Cite Search Fix data