Exploring In-context Example Generation for Machine Translation

Dohyun Lee; Seungil Chad Lee; Chanwoo Yang; Yujin Baek; Jaegul Choo

doi:10.18653/v1/2025.findings-acl.1362

Exploring In-context Example Generation for Machine Translation

Dohyun Lee, Seungil Chad Lee, Chanwoo Yang, Yujin Baek, Jaegul Choo

Abstract

Large language models (LLMs) have demonstrated strong performance across various tasks, leveraging their exceptional in-context learning ability with only a few examples.Accordingly, the selection of optimal in-context examples has been actively studied in the field of machine translation.However, these studies presuppose the presence of a demonstration pool with human-annotated pairs, making them less applicable to low-resource languages where such an assumption is challenging to meet.To overcome this limitation, this paper explores the research direction of in-context example generation for machine translation.Specifically, we propose Demonstration Augmentation for Translation (DAT), a simple yet effective approach that generates example pairs without relying on any external resources.This method builds upon two prior criteria, relevance and diversity, which have been highlighted in previous work as key factors for in-context example selection.Through experiments and analysis on low-resource languages where human-annotated pairs are scarce, we show that DAT achieves superior translation quality compared to the baselines.Furthermore, we investigate the potential of progressively accumulating generated pairs during test time to build and reuse a demonstration pool. Our implementation is publicly available at https://github.com/aiclaudev/DAT.

Anthology ID:: 2025.findings-acl.1362
Volume:: Findings of the Association for Computational Linguistics: ACL 2025
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 26554–26568
Language:
URL:: https://aclanthology.org/2025.findings-acl.1362/
DOI:: 10.18653/v1/2025.findings-acl.1362
Bibkey:
Cite (ACL):: Dohyun Lee, Seungil Chad Lee, Chanwoo Yang, Yujin Baek, and Jaegul Choo. 2025. Exploring In-context Example Generation for Machine Translation. In Findings of the Association for Computational Linguistics: ACL 2025, pages 26554–26568, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Exploring In-context Example Generation for Machine Translation (Lee et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-acl.1362.pdf

PDF Cite Search Fix data