Dissecting In-Context Learning of Translations in GPT-3

Vikas Raunak, Arul Menezes, Hany Awadalla


Abstract
Most of the recent work in leveraging Large Language Models (LLMs) such as GPT-3 for Machine Translation (MT) has focused on selecting the few-shot samples for prompting. In this work, we try to better understand the role of demonstration attributes for the in-context learning of translations through perturbations of high-quality, in-domain demonstrations. We find that asymmetric perturbation of the source-target mappings yield vastly different results. We show that the perturbation of the source side has surprisingly little impact, while target perturbation can drastically reduce translation quality, suggesting that it is the output text distribution that provides the most important learning signal during in-context learning of translations. We propose a method named Zero-Shot-Context to add this signal automatically in Zero-Shot prompting. We demonstrate that it improves upon the zero-shot translation performance of GPT-3, even making it competitive with few-shot prompted translations.
Anthology ID:
2023.findings-emnlp.61
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2023
Month:
December
Year:
2023
Address:
Singapore
Editors:
Houda Bouamor, Juan Pino, Kalika Bali
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
866–872
Language:
URL:
https://aclanthology.org/2023.findings-emnlp.61
DOI:
10.18653/v1/2023.findings-emnlp.61
Bibkey:
Cite (ACL):
Vikas Raunak, Arul Menezes, and Hany Awadalla. 2023. Dissecting In-Context Learning of Translations in GPT-3. In Findings of the Association for Computational Linguistics: EMNLP 2023, pages 866–872, Singapore. Association for Computational Linguistics.
Cite (Informal):
Dissecting In-Context Learning of Translations in GPT-3 (Raunak et al., Findings 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.findings-emnlp.61.pdf