Instance-Based Domain Adaptation for Improving Terminology Translation

Prashanth Nayak, John Kelleher, Rejwanul Haque, Andy Way


Abstract
Terms are essential indicators of a domain, and domain term translation is dealt with priority in any translation workflow. Translation service providers who use machine translation (MT) expect term translation to be unambiguous and consistent with the context and domain in question. Although current state-of-the-art neural MT (NMT) models are able to produce high-quality translations for many languages, they are still not at the level required when it comes to translating domain-specific terms. This study presents a terminology-aware instance- based adaptation method for improving terminology translation in NMT. We conducted our experiments for French-to-English and found that our proposed approach achieves a statistically significant improvement over the baseline NMT system in translating domain-specific terms. Specifically, the translation of multi-word terms is improved by 6.7% compared to the strong baseline.
Anthology ID:
2023.mtsummit-research.19
Volume:
Proceedings of Machine Translation Summit XIX, Vol. 1: Research Track
Month:
September
Year:
2023
Address:
Macau SAR, China
Editors:
Masao Utiyama, Rui Wang
Venue:
MTSummit
SIG:
Publisher:
Asia-Pacific Association for Machine Translation
Note:
Pages:
222–234
Language:
URL:
https://aclanthology.org/2023.mtsummit-research.19
DOI:
Bibkey:
Cite (ACL):
Prashanth Nayak, John Kelleher, Rejwanul Haque, and Andy Way. 2023. Instance-Based Domain Adaptation for Improving Terminology Translation. In Proceedings of Machine Translation Summit XIX, Vol. 1: Research Track, pages 222–234, Macau SAR, China. Asia-Pacific Association for Machine Translation.
Cite (Informal):
Instance-Based Domain Adaptation for Improving Terminology Translation (Nayak et al., MTSummit 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.mtsummit-research.19.pdf