Efficient Unsupervised Sentence Compression by Fine-tuning Transformers with Reinforcement Learning

Demian Ghalandari, Chris Hokamp, Georgiana Ifrim


Abstract
Sentence compression reduces the length of text by removing non-essential content while preserving important facts and grammaticality. Unsupervised objective driven methods for sentence compression can be used to create customized models without the need for ground-truth training data, while allowing flexibility in the objective function(s) that are used for learning and inference. Recent unsupervised sentence compression approaches use custom objectives to guide discrete search; however, guided search is expensive at inference time. In this work, we explore the use of reinforcement learning to train effective sentence compression models that are also fast when generating predictions. In particular, we cast the task as binary sequence labelling and fine-tune a pre-trained transformer using a simple policy gradient approach. Our approach outperforms other unsupervised models while also being more efficient at inference time.
Anthology ID:
2022.acl-long.90
Volume:
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
May
Year:
2022
Address:
Dublin, Ireland
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1267–1280
Language:
URL:
https://aclanthology.org/2022.acl-long.90
DOI:
10.18653/v1/2022.acl-long.90
Bibkey:
Cite (ACL):
Demian Ghalandari, Chris Hokamp, and Georgiana Ifrim. 2022. Efficient Unsupervised Sentence Compression by Fine-tuning Transformers with Reinforcement Learning. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1267–1280, Dublin, Ireland. Association for Computational Linguistics.
Cite (Informal):
Efficient Unsupervised Sentence Compression by Fine-tuning Transformers with Reinforcement Learning (Ghalandari et al., ACL 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.acl-long.90.pdf
Code
 complementizer/rl-sentence-compression
Data
NEWSROOMSentence Compression