Dictionary-Aided Translation for Handling Multi-Word Expressions in Low-Resource Languages

Antonios Dimakis, Stella Markantonatou, Antonios Anastasopoulos


Abstract
Multi-word expressions (MWEs) present unique challenges in natural language processing (NLP), particularly within the context of translation systems, due to their inherent scarcity, non-compositional nature, and other distinct lexical and morphosyntactic characteristics, issues that are exacerbated in low-resource settings.In this study, we elucidate and attempt to address these challenges by leveraging a substantial corpus of human-annotated Greek MWEs. To address the complexity of translating such phrases, we propose a novel method leveraging an available out-of-context lexicon.We assess the translation capabilities of current state-of-the-art systems on this task, employing both automated metrics and human evaluators.We find that by using our method when applicable, the performance of current systems can be significantly improved, however these models are still unable to produce translations comparable to those of a human speaker.
Anthology ID:
2024.findings-acl.152
Volume:
Findings of the Association for Computational Linguistics ACL 2024
Month:
August
Year:
2024
Address:
Bangkok, Thailand and virtual meeting
Editors:
Lun-Wei Ku, Andre Martins, Vivek Srikumar
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2588–2595
Language:
URL:
https://aclanthology.org/2024.findings-acl.152
DOI:
Bibkey:
Cite (ACL):
Antonios Dimakis, Stella Markantonatou, and Antonios Anastasopoulos. 2024. Dictionary-Aided Translation for Handling Multi-Word Expressions in Low-Resource Languages. In Findings of the Association for Computational Linguistics ACL 2024, pages 2588–2595, Bangkok, Thailand and virtual meeting. Association for Computational Linguistics.
Cite (Informal):
Dictionary-Aided Translation for Handling Multi-Word Expressions in Low-Resource Languages (Dimakis et al., Findings 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.findings-acl.152.pdf