Exploring the Effectiveness of LLM Domain Adaptation for Business IT Machine Translation

Johannes Eschbach-Dymanus; Frank Essenberger; Bianka Buschbeck; Miriam Exel

Exploring the Effectiveness of LLM Domain Adaptation for Business IT Machine Translation

Johannes Eschbach-Dymanus, Frank Essenberger, Bianka Buschbeck, Miriam Exel

Abstract

In this paper, we study the translation abilities of Large Language Models (LLMs) for business IT texts.We are strongly interested in domain adaptation of translation systems, which is essential for accurate and lexically appropriate translation of such texts.Among the open-source models evaluated in a zero- and few-shot setting, we find Llama-2 13B the most promising for domain-specific translation fine-tuning.We investigate the full range of adaptation techniques for LLMs: from prompting, over parameter-efficient fine-tuning to full fine-tuning, and compare to classic neural machine translation (MT) models trained internally at SAP.We provide guidance how to use training budget most effectively for different fine-tuning approaches.We observe that while LLMs can translate on-par with SAP’s MT models on general domain data, it is difficult to close the gap on SAP’s domain-specific data, even with extensive training and carefully curated data.

Anthology ID:: 2024.eamt-1.51
Volume:: Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1)
Month:: June
Year:: 2024
Address:: Sheffield, UK
Editors:: Carolina Scarton, Charlotte Prescott, Chris Bayliss, Chris Oakley, Joanna Wright, Stuart Wrigley, Xingyi Song, Edward Gow-Smith, Rachel Bawden, Víctor M Sánchez-Cartagena, Patrick Cadwell, Ekaterina Lapshinova-Koltunski, Vera Cabarrão, Konstantinos Chatzitheodorou, Mary Nurminen, Diptesh Kanojia, Helena Moniz
Venue:: EAMT
SIG:
Publisher:: European Association for Machine Translation (EAMT)
Note:
Pages:: 610–622
Language:
URL:: https://aclanthology.org/2024.eamt-1.51
DOI:
Bibkey:
Cite (ACL):: Johannes Eschbach-Dymanus, Frank Essenberger, Bianka Buschbeck, and Miriam Exel. 2024. Exploring the Effectiveness of LLM Domain Adaptation for Business IT Machine Translation. In Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1), pages 610–622, Sheffield, UK. European Association for Machine Translation (EAMT).
Cite (Informal):: Exploring the Effectiveness of LLM Domain Adaptation for Business IT Machine Translation (Eschbach-Dymanus et al., EAMT 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.eamt-1.51.pdf

PDF Cite Search