On Translating Technical Terminology: A Translation Workflow for Machine-Translated Acronyms

Richard Yue; John Ortega; Kenneth Church

On Translating Technical Terminology: A Translation Workflow for Machine-Translated Acronyms

Richard Yue, John Ortega, Kenneth Church

Abstract

The typical workflow for a professional translator to translate a document from its source language (SL) to a target language (TL) is not always focused on what many language models in natural language processing (NLP) do - predict the next word in a series of words. While high-resource languages like English and French are reported to achieve near human parity using common metrics for measurement such as BLEU and COMET, we find that an important step is being missed: the translation of technical terms, specifically acronyms. Some state-of-the art machine translation systems like Google Translate which are publicly available can be erroneous when dealing with acronyms - as much as 50% in our findings. This article addresses acronym disambiguation for MT systems by proposing an additional step to the SL-TL (FR-EN) translation workflow where we first offer a new acronym corpus for public consumption and then experiment with a search-based thresholding algorithm that achieves nearly 10% increase when compared to Google Translate and OpusMT.

Anthology ID:: 2024.amta-research.6
Volume:: Proceedings of the 16th Conference of the Association for Machine Translation in the Americas (Volume 1: Research Track)
Month:: September
Year:: 2024
Address:: Chicago, USA
Editors:: Rebecca Knowles, Akiko Eriguchi, Shivali Goel
Venue:: AMTA
SIG:
Publisher:: Association for Machine Translation in the Americas
Note:
Pages:: 48–54
Language:
URL:: https://aclanthology.org/2024.amta-research.6/
DOI:
Bibkey:
Cite (ACL):: Richard Yue, John Ortega, and Kenneth Church. 2024. On Translating Technical Terminology: A Translation Workflow for Machine-Translated Acronyms. In Proceedings of the 16th Conference of the Association for Machine Translation in the Americas (Volume 1: Research Track), pages 48–54, Chicago, USA. Association for Machine Translation in the Americas.
Cite (Informal):: On Translating Technical Terminology: A Translation Workflow for Machine-Translated Acronyms (Yue et al., AMTA 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.amta-research.6.pdf

PDF Cite Search Fix data