Empowering cross-lingual abilities of instruction-tuned large language models by translation-following demonstrations

Leonardo Ranaldi, Giulia Pucci, Andre Freitas


Abstract
The language ability of Large Language Models (LLMs) is often unbalanced towards English because of the imbalance in the distribution of the pre-training data. This disparity is demanded in further fine-tuning and affecting the cross-lingual abilities of LLMs. In this paper, we propose to empower Instruction-tuned LLMs (It-LLMs) in languages other than English by building semantic alignment between them. Hence, we propose CrossAlpaca, an It-LLM with cross-lingual Instruction-following and Translation-following demonstrations to improve semantic alignment between languages. We validate our approach on the multilingual Question Answering (QA) benchmarks XQUAD and MLQA and adapted versions of MMLU and BBH.Our models, tested over six different languages, outperform the It-LLMs tuned on monolingual data. The final results show that instruction tuning on non-English data is not enough and that semantic alignment can be further improved by Translation-following demonstrations.
Anthology ID:
2024.findings-acl.473
Volume:
Findings of the Association for Computational Linguistics ACL 2024
Month:
August
Year:
2024
Address:
Bangkok, Thailand and virtual meeting
Editors:
Lun-Wei Ku, Andre Martins, Vivek Srikumar
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
7961–7973
Language:
URL:
https://aclanthology.org/2024.findings-acl.473
DOI:
Bibkey:
Cite (ACL):
Leonardo Ranaldi, Giulia Pucci, and Andre Freitas. 2024. Empowering cross-lingual abilities of instruction-tuned large language models by translation-following demonstrations. In Findings of the Association for Computational Linguistics ACL 2024, pages 7961–7973, Bangkok, Thailand and virtual meeting. Association for Computational Linguistics.
Cite (Informal):
Empowering cross-lingual abilities of instruction-tuned large language models by translation-following demonstrations (Ranaldi et al., Findings 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.findings-acl.473.pdf