Enhancing Few-shot Cross-lingual Transfer with Target Language Peculiar Examples

Hwichan Kim; Mamoru Komachi

doi:10.18653/v1/2023.findings-acl.47

Enhancing Few-shot Cross-lingual Transfer with Target Language Peculiar Examples

Abstract

Few-shot cross-lingual transfer, fine-tuning Multilingual Masked Language Model (MMLM) with source language labeled data and a small amount of target language labeled data, provides excellent performance in the target language. However, if no labeled data in the target language are available, they need to be created through human annotations. In this study, we devise a metric to select annotation candidates from an unlabeled data pool that efficiently enhance accuracy for few-shot cross-lingual transfer. It is known that training a model with hard examples is important to improve the model’s performance. Therefore, we first identify examples that MMLM cannot solve in a zero-shot cross-lingual transfer setting and demonstrate that it is hard to predict peculiar examples in the target language, i.e., the examples distant from the source language examples in cross-lingual semantic space of the MMLM.We then choose high peculiarity examples as annotation candidates and perform few-shot cross-lingual transfer. In comprehensive experiments with 20 languages and 6 tasks, we demonstrate that the high peculiarity examples improve the target language accuracy compared to other candidate selection methods proposed in previous studies.

Anthology ID:: 2023.findings-acl.47
Volume:: Findings of the Association for Computational Linguistics: ACL 2023
Month:: July
Year:: 2023
Address:: Toronto, Canada
Editors:: Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 747–767
Language:
URL:: https://aclanthology.org/2023.findings-acl.47
DOI:: 10.18653/v1/2023.findings-acl.47
Bibkey:
Cite (ACL):: Hwichan Kim and Mamoru Komachi. 2023. Enhancing Few-shot Cross-lingual Transfer with Target Language Peculiar Examples. In Findings of the Association for Computational Linguistics: ACL 2023, pages 747–767, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):: Enhancing Few-shot Cross-lingual Transfer with Target Language Peculiar Examples (Kim & Komachi, Findings 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.findings-acl.47.pdf

PDF Cite Search