PROM: Pivoted and Regulated Optimization for Multilingual Instruction Learning

Jaeseong Lee; Seung-won Hwang; Hojin Lee; Yunju Bak; Changmin Lee

doi:10.18653/v1/2025.naacl-short.19

PROM: Pivoted and Regulated Optimization for Multilingual Instruction Learning

Jaeseong Lee, Seung-won Hwang, Hojin Lee, Yunju Bak, Changmin Lee

Abstract

Large language models (LLMs) have become standard for natural language generation tasks, with instruction-tuning enhancing their capabilities. However, the lack of instruction-tuning datasets in languages other than English limits their application to diverse languages. To address this, researchers have adapted English-centric LLMs to other languages by appending English tuning data with its translated pair, from which we observe negative interference between the two. To resolve this, our contribution is identifying English as an internal pivot language, based on which we disentangle the roles of English and target language data in training. Specifically, we first design two roles as pivoted objectives, and also propose to regulate between the two, to better generalize for under-represented languages. Experiments across various languages demonstrate the effectiveness of our approach on multiple benchmarks. The code is publicly available for further exploration.

Anthology ID:: 2025.naacl-short.19
Volume:: Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers)
Month:: April
Year:: 2025
Address:: Albuquerque, New Mexico
Editors:: Luis Chiruzzo, Alan Ritter, Lu Wang
Venue:: NAACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 222–228
Language:
URL:: https://aclanthology.org/2025.naacl-short.19/
DOI:: 10.18653/v1/2025.naacl-short.19
Bibkey:
Cite (ACL):: Jaeseong Lee, Seung-won Hwang, Hojin Lee, Yunju Bak, and Changmin Lee. 2025. PROM: Pivoted and Regulated Optimization for Multilingual Instruction Learning. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers), pages 222–228, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):: PROM: Pivoted and Regulated Optimization for Multilingual Instruction Learning (Lee et al., NAACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.naacl-short.19.pdf

PDF Cite Search Fix data