Plug-and-Play Document Modules for Pre-trained Models

Chaojun Xiao; Zhengyan Zhang; Xu Han; Chi-Min Chan; Yankai Lin; Zhiyuan Liu; Xiangyang Li; Zhonghua Li; Zhao Cao; Maosong Sun

doi:10.18653/v1/2023.acl-long.875

Plug-and-Play Document Modules for Pre-trained Models

Chaojun Xiao, Zhengyan Zhang, Xu Han, Chi-Min Chan, Yankai Lin, Zhiyuan Liu, Xiangyang Li, Zhonghua Li, Zhao Cao, Maosong Sun

Abstract

Large-scale pre-trained models (PTMs) have been widely used in document-oriented NLP tasks, such as question answering. However, the encoding-task coupling requirement results in the repeated encoding of the same documents for different tasks and queries, which is highly computationally inefficient. To this end, we target to decouple document encoding from downstream tasks, and propose to represent each document as a plug-and-play document module, i.e., a document plugin, for PTMs (PlugD). By inserting document plugins into the backbone PTM for downstream tasks, we can encode a document one time to handle multiple tasks, which is more efficient than conventional encoding-task coupling methods that simultaneously encode documents and input queries using task-specific encoders. Extensive experiments on 8 datasets of 4 typical NLP tasks show that PlugD enables models to encode documents once and for all across different scenarios. Especially, PlugD can save 69% computational costs while achieving comparable performance to state-of-the-art encoding-task coupling methods. Additionally, we show that PlugD can serve as an effective post-processing way to inject knowledge into task-specific models, improving model performance without any additional model training. Our code and checkpoints can be found in https://github.com/thunlp/Document-Plugin.

Anthology ID:: 2023.acl-long.875
Volume:: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2023
Address:: Toronto, Canada
Editors:: Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 15713–15729
Language:
URL:: https://aclanthology.org/2023.acl-long.875
DOI:: 10.18653/v1/2023.acl-long.875
Bibkey:
Cite (ACL):: Chaojun Xiao, Zhengyan Zhang, Xu Han, Chi-Min Chan, Yankai Lin, Zhiyuan Liu, Xiangyang Li, Zhonghua Li, Zhao Cao, and Maosong Sun. 2023. Plug-and-Play Document Modules for Pre-trained Models. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 15713–15729, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):: Plug-and-Play Document Modules for Pre-trained Models (Xiao et al., ACL 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.acl-long.875.pdf
Video:: https://aclanthology.org/2023.acl-long.875.mp4

PDF Cite Search Video