Detecting and Translating Language Ambiguity with Multilingual LLMs

Behrang Mehrparvar, Sandro Pezzelle


Abstract
Most languages could be ambiguous, which means the same conveyed text or speech, results in different actions by different readers or listeners. In this project, we propose a method to detect the ambiguity of a sentence using translation by multilingual LLMs. In particular, we hypothesize that a good machine translator should preserve the ambiguity of sentences in all target languages. Therefore, we investigate whether ambiguity is encoded in the hidden representation of a translation model or, instead, if only a single meaning is encoded. In our experiments, we have been able to predict ambiguity of sentences with high accuracy using machine translation without direct use of semantics and only based on the reconstruction error of a function that maps the forward and backward translation hidden representations to each other. The potential applications of the proposed approach span i) detecting ambiguous sentences, ii) fine-tuning existing multilingual LLMs to preserve ambiguous information, and iii) developing AI systems that can generate ambiguity-free languages when needed.
Anthology ID:
2024.mrl-1.26
Volume:
Proceedings of the Fourth Workshop on Multilingual Representation Learning (MRL 2024)
Month:
November
Year:
2024
Address:
Miami, Florida, USA
Editors:
Jonne Sälevä, Abraham Owodunni
Venue:
MRL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
310–323
Language:
URL:
https://aclanthology.org/2024.mrl-1.26
DOI:
Bibkey:
Cite (ACL):
Behrang Mehrparvar and Sandro Pezzelle. 2024. Detecting and Translating Language Ambiguity with Multilingual LLMs. In Proceedings of the Fourth Workshop on Multilingual Representation Learning (MRL 2024), pages 310–323, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):
Detecting and Translating Language Ambiguity with Multilingual LLMs (Mehrparvar & Pezzelle, MRL 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.mrl-1.26.pdf