Liangxin Liu
2024
Curriculum Consistency Learning for Conditional Sentence Generation
Liangxin Liu
|
Xuebo Liu
|
Lian Lian
|
Shengjun Cheng
|
Jun Rao
|
Tengfei Yu
|
Hexuan Deng
|
Min Zhang
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Consistency learning (CL) has proven to be a valuable technique for improving the robustness of models in conditional sentence generation (CSG) tasks by ensuring stable predictions across various input data forms. However, models augmented with CL often face challenges in optimizing consistency features, which can detract from their efficiency and effectiveness. To address these challenges, we introduce Curriculum Consistency Learning (CCL), a novel strategy that guides models to learn consistency in alignment with their current capacity to differentiate between features. CCL is designed around the inherent aspects of CL-related losses, promoting task independence and simplifying implementation. Implemented across four representative CSG tasks, including instruction tuning (IT) for large language models and machine translation (MT) in three modalities (text, speech, and vision), CCL demonstrates marked improvements. Specifically, it delivers +2.0 average accuracy point improvement compared with vanilla IT and an average increase of +0.7 in COMET scores over traditional CL methods in MT tasks. Our comprehensive analysis further indicates that models utilizing CCL are particularly adept at managing complex instances, showcasing the effectiveness and efficiency of CCL in improving CSG models. Code and scripts are available at https://github.com/xinxinxing/Curriculum-Consistency-Learning.
Search
Fix data
Co-authors
- Shengjun Cheng 1
- Hexuan Deng 1
- Lian Lian 1
- Xuebo Liu 1
- Jun Rao 1
- show all...