Quantifying the Contribution of MWEs and Polysemy in Translation Errors for English–Igbo MT

Adaeze Ohuoba, Serge Sharoff, Callum Walker


Abstract
In spite of recent successes in improving Machine Translation (MT) quality overall, MT engines require a large amount of resources, which leads to markedly lower quality for lesser-resourced languages. This study explores the case of translation from English into Igbo, a very low resource language spoken by about 45 million speakers. With the aim of improving MT quality in this scenario, we investigate methods for guided detection of critical/harmful MT errors, more specifically those caused by non-compositional multi-word expressions and polysemy. We have designed diagnostic tests for these cases and applied them to collections of medical texts from CDC, Cochrane, NCDC, NHS and WHO.
Anthology ID:
2024.eamt-1.43
Volume:
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1)
Month:
June
Year:
2024
Address:
Sheffield, UK
Editors:
Carolina Scarton, Charlotte Prescott, Chris Bayliss, Chris Oakley, Joanna Wright, Stuart Wrigley, Xingyi Song, Edward Gow-Smith, Rachel Bawden, Víctor M Sánchez-Cartagena, Patrick Cadwell, Ekaterina Lapshinova-Koltunski, Vera Cabarrão, Konstantinos Chatzitheodorou, Mary Nurminen, Diptesh Kanojia, Helena Moniz
Venue:
EAMT
SIG:
Publisher:
European Association for Machine Translation (EAMT)
Note:
Pages:
537–547
Language:
URL:
https://aclanthology.org/2024.eamt-1.43
DOI:
Bibkey:
Cite (ACL):
Adaeze Ohuoba, Serge Sharoff, and Callum Walker. 2024. Quantifying the Contribution of MWEs and Polysemy in Translation Errors for English–Igbo MT. In Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1), pages 537–547, Sheffield, UK. European Association for Machine Translation (EAMT).
Cite (Informal):
Quantifying the Contribution of MWEs and Polysemy in Translation Errors for English–Igbo MT (Ohuoba et al., EAMT 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.eamt-1.43.pdf