Coming to Terms with Glossary Enforcement: A Study of Three Approaches to Enforcing Terminology in NMT

Fred Bane, Anna Zaretskaya, Tània Blanch Miró, Celia Soler Uguet, João Torres


Abstract
Enforcing terminology constraints is less straight-forward in neural machine translation (NMT) than statistical machine translation. Current methods, such as alignment-based insertion or the use of factors or special tokens, each have their strengths and drawbacks. We describe the current state of research on terminology enforcement in transformer-based NMT models, and present the results of our investigation into the performance of three different approaches. In addition to reference based quality metrics, we also evaluate the linguistic quality of the translations thus produced. Our results show that each approach is effective, though a negative impact on translation fluency remains evident.
Anthology ID:
2023.eamt-1.34
Volume:
Proceedings of the 24th Annual Conference of the European Association for Machine Translation
Month:
June
Year:
2023
Address:
Tampere, Finland
Editors:
Mary Nurminen, Judith Brenner, Maarit Koponen, Sirkku Latomaa, Mikhail Mikhailov, Frederike Schierl, Tharindu Ranasinghe, Eva Vanmassenhove, Sergi Alvarez Vidal, Nora Aranberri, Mara Nunziatini, Carla Parra Escartín, Mikel Forcada, Maja Popovic, Carolina Scarton, Helena Moniz
Venue:
EAMT
SIG:
Publisher:
European Association for Machine Translation
Note:
Pages:
345–353
Language:
URL:
https://aclanthology.org/2023.eamt-1.34
DOI:
Bibkey:
Cite (ACL):
Fred Bane, Anna Zaretskaya, Tània Blanch Miró, Celia Soler Uguet, and João Torres. 2023. Coming to Terms with Glossary Enforcement: A Study of Three Approaches to Enforcing Terminology in NMT. In Proceedings of the 24th Annual Conference of the European Association for Machine Translation, pages 345–353, Tampere, Finland. European Association for Machine Translation.
Cite (Informal):
Coming to Terms with Glossary Enforcement: A Study of Three Approaches to Enforcing Terminology in NMT (Bane et al., EAMT 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.eamt-1.34.pdf