Getting More from Less: Large Language Models are Good Spontaneous Multilingual Learners

Shimao Zhang; Changjiang Gao; Wenhao Zhu; Jiajun Chen; Xin Huang; Xue Han; Junlan Feng; Chao Deng; Shujian Huang

Getting More from Less: Large Language Models are Good Spontaneous Multilingual Learners

Shimao Zhang, Changjiang Gao, Wenhao Zhu, Jiajun Chen, Xin Huang, Xue Han, Junlan Feng, Chao Deng, Shujian Huang

Abstract

Recently, Large Language Models (LLMs) have shown impressive language capabilities, while most of them have very unbalanced performance across different languages. Multilingual alignment based on the translation parallel data is an effective method to enhance LLMs’ multilingual capabilities. In this work, we first discover and comprehensively investigate the spontaneous multilingual alignment of LLMs. Firstly, we find that LLMs instruction-tuned on the question translation data (i.e. without annotated answers) are able to encourage the alignment between English and a wide range of languages, even including those unseen during instruction-tuning. Additionally, we utilize different settings and mechanistic interpretability methods to analyze the LLM’s performance in the multilingual scenario comprehensively. Our work suggests that LLMs have enormous potential for improving multilingual alignment efficiently with great language generalization and task generalization.

Anthology ID:: 2024.emnlp-main.457
Volume:: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2024
Address:: Miami, Florida, USA
Editors:: Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 8037–8051
Language:
URL:: https://aclanthology.org/2024.emnlp-main.457
DOI:
Bibkey:
Cite (ACL):: Shimao Zhang, Changjiang Gao, Wenhao Zhu, Jiajun Chen, Xin Huang, Xue Han, Junlan Feng, Chao Deng, and Shujian Huang. 2024. Getting More from Less: Large Language Models are Good Spontaneous Multilingual Learners. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pages 8037–8051, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):: Getting More from Less: Large Language Models are Good Spontaneous Multilingual Learners (Zhang et al., EMNLP 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.emnlp-main.457.pdf

PDF Cite Search