Abstractive Text Summarization Using the BRIO Training Paradigm

Khang Lam, Thieu Doan, Khang Pham, Jugal Kalita


Abstract
Summary sentences produced by abstractive summarization models may be coherent and comprehensive, but they lack control and rely heavily on reference summaries. The BRIO training paradigm assumes a non-deterministic distribution to reduce the model’s dependence on reference summaries, and improve model performance during inference. This paper presents a straightforward but effective technique to improve abstractive summaries by fine-tuning pre-trained language models, and training them with the BRIO paradigm. We build a text summarization dataset for Vietnamese, called VieSum. We perform experiments with abstractive summarization models trained with the BRIO paradigm on the CNNDM and the VieSum datasets. The results show that the models, trained on basic hardware, outperform all existing abstractive summarization models, especially for Vietnamese.
Anthology ID:
2023.findings-acl.7
Volume:
Findings of the Association for Computational Linguistics: ACL 2023
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
92–99
Language:
URL:
https://aclanthology.org/2023.findings-acl.7
DOI:
10.18653/v1/2023.findings-acl.7
Bibkey:
Cite (ACL):
Khang Lam, Thieu Doan, Khang Pham, and Jugal Kalita. 2023. Abstractive Text Summarization Using the BRIO Training Paradigm. In Findings of the Association for Computational Linguistics: ACL 2023, pages 92–99, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
Abstractive Text Summarization Using the BRIO Training Paradigm (Lam et al., Findings 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.findings-acl.7.pdf