Lifelong Sequence Generation with Dynamic Module Expansion and Adaptation

Chengwei Qin; Chen Chen; Shafiq Joty

doi:10.18653/v1/2023.emnlp-main.414

Lifelong Sequence Generation with Dynamic Module Expansion and Adaptation

Abstract

Lifelong sequence generation (LSG), a problem in continual learning, aims to continually train a model on a sequence of generation tasks to learn constantly emerging new generation patterns while avoiding the forgetting of previous knowledge. Existing LSG methods mainly focus on maintaining old knowledge while paying little attention to knowledge transfer across tasks. In contrast, humans can better learn new tasks by leveraging previously acquired knowledge from similar tasks. Inspired by the learning paradigm of humans, we propose Dynamic Module Expansion and Adaptation (DMEA), which enables the model to dynamically determine the architecture for acquiring new knowledge based on task correlation and select the most similar previous tasks to facilitate adaptation to new tasks. In addition, as the learning process can easily be biased towards the current task which might cause more severe forgetting of previously learned knowledge, we propose dynamic gradient scaling to balance the learning of the current task and replayed tasks. With extensive experiments, we demonstrate that DMEA can consistently outperform existing methods in different LSG settings.

Anthology ID:: 2023.emnlp-main.414
Volume:: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
Month:: December
Year:: 2023
Address:: Singapore
Editors:: Houda Bouamor, Juan Pino, Kalika Bali
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 6701–6714
Language:
URL:: https://aclanthology.org/2023.emnlp-main.414/
DOI:: 10.18653/v1/2023.emnlp-main.414
Bibkey:
Cite (ACL):: Chengwei Qin, Chen Chen, and Shafiq Joty. 2023. Lifelong Sequence Generation with Dynamic Module Expansion and Adaptation. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 6701–6714, Singapore. Association for Computational Linguistics.
Cite (Informal):: Lifelong Sequence Generation with Dynamic Module Expansion and Adaptation (Qin et al., EMNLP 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.emnlp-main.414.pdf
Video:: https://aclanthology.org/2023.emnlp-main.414.mp4

PDF Cite Search Video Fix data