Domain Aligned Prefix Averaging for Domain Generalization in Abstractive Summarization

Pranav Nair, Sukomal Pal, Pradeepika Verma


Abstract
Domain generalization is hitherto an underexplored area applied in abstractive summarization. Moreover, most existing works on domain generalization have sophisticated training algorithms. In this paper, we propose a lightweight, weight averaging based, Domain Aligned Prefix Averaging approach to domain generalization for abstractive summarization. Given a number of source domains, our method first trains a prefix for each one of them. These source prefixes generate summaries for a small number of target domain documents. The similarity of the generated summaries to their corresponding source documents is used for calculating weights required to average source prefixes. In DAPA, prefix tuning allows for lightweight finetuning, and weight averaging allows for the computationally efficient addition of new source domains. When evaluated on four diverse summarization domains, DAPA shows comparable or better performance against the baselines demonstrating the effectiveness of its prefix averaging scheme.
Anthology ID:
2023.findings-acl.288
Volume:
Findings of the Association for Computational Linguistics: ACL 2023
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
4696–4710
Language:
URL:
https://aclanthology.org/2023.findings-acl.288
DOI:
10.18653/v1/2023.findings-acl.288
Bibkey:
Cite (ACL):
Pranav Nair, Sukomal Pal, and Pradeepika Verma. 2023. Domain Aligned Prefix Averaging for Domain Generalization in Abstractive Summarization. In Findings of the Association for Computational Linguistics: ACL 2023, pages 4696–4710, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
Domain Aligned Prefix Averaging for Domain Generalization in Abstractive Summarization (Nair et al., Findings 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.findings-acl.288.pdf