Curriculum Consistency Learning for Conditional Sentence Generation

Liangxin Liu; Xuebo Liu; Lian Lian; Shengjun Cheng; Jun Rao; Tengfei Yu; Hexuan Deng; Min Zhang

doi:10.18653/v1/2024.emnlp-main.768

Curriculum Consistency Learning for Conditional Sentence Generation

Liangxin Liu, Xuebo Liu, Lian Lian, Shengjun Cheng, Jun Rao, Tengfei Yu, Hexuan Deng, Min Zhang

Abstract

Consistency learning (CL) has proven to be a valuable technique for improving the robustness of models in conditional sentence generation (CSG) tasks by ensuring stable predictions across various input data forms. However, models augmented with CL often face challenges in optimizing consistency features, which can detract from their efficiency and effectiveness. To address these challenges, we introduce Curriculum Consistency Learning (CCL), a novel strategy that guides models to learn consistency in alignment with their current capacity to differentiate between features. CCL is designed around the inherent aspects of CL-related losses, promoting task independence and simplifying implementation. Implemented across four representative CSG tasks, including instruction tuning (IT) for large language models and machine translation (MT) in three modalities (text, speech, and vision), CCL demonstrates marked improvements. Specifically, it delivers +2.0 average accuracy point improvement compared with vanilla IT and an average increase of +0.7 in COMET scores over traditional CL methods in MT tasks. Our comprehensive analysis further indicates that models utilizing CCL are particularly adept at managing complex instances, showcasing the effectiveness and efficiency of CCL in improving CSG models. Code and scripts are available at https://github.com/xinxinxing/Curriculum-Consistency-Learning.

Anthology ID:: 2024.emnlp-main.768
Volume:: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2024
Address:: Miami, Florida, USA
Editors:: Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 13865–13881
Language:
URL:: https://aclanthology.org/2024.emnlp-main.768/
DOI:: 10.18653/v1/2024.emnlp-main.768
Bibkey:
Cite (ACL):: Liangxin Liu, Xuebo Liu, Lian Lian, Shengjun Cheng, Jun Rao, Tengfei Yu, Hexuan Deng, and Min Zhang. 2024. Curriculum Consistency Learning for Conditional Sentence Generation. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pages 13865–13881, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):: Curriculum Consistency Learning for Conditional Sentence Generation (Liu et al., EMNLP 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.emnlp-main.768.pdf
Software:: 2024.emnlp-main.768.software.zip

PDF Cite Search Software Fix data