Reflective Decoding: Beyond Unidirectional Generation with Off-the-Shelf Language Models

Peter West; Ximing Lu; Ari Holtzman; Chandra Bhagavatula; Jena D. Hwang; Yejin Choi

doi:10.18653/v1/2021.acl-long.114

Reflective Decoding: Beyond Unidirectional Generation with Off-the-Shelf Language Models

Peter West, Ximing Lu, Ari Holtzman, Chandra Bhagavatula, Jena D. Hwang, Yejin Choi

Abstract

Publicly available, large pretrained Language Models (LMs) generate text with remarkable quality, but only sequentially from left to right. As a result, they are not immediately applicable to generation tasks that break the unidirectional assumption, such as paraphrasing or text-infilling, necessitating task-specific supervision. In this paper, we present Reflective Decoding, a novel unsupervised algorithm that allows for direct application of unidirectional LMs to non-sequential tasks. Our 2-step approach requires no supervision or even parallel corpora, only two off-the-shelf pretrained LMs in opposite directions: forward and backward. First, in the contextualization step, we use LMs to generate ensembles of past and future contexts which collectively capture the input (e.g. the source sentence for paraphrasing). Second, in the reflection step, we condition on these “context ensembles”, generating outputs that are compatible with them. Comprehensive empirical results demonstrate that Reflective Decoding outperforms strong unsupervised baselines on both paraphrasing and abductive text infilling, significantly narrowing the gap between unsupervised and supervised methods. Reflective Decoding surpasses multiple supervised baselines on various metrics including human evaluation.

Anthology ID:: 2021.acl-long.114
Volume:: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
Month:: August
Year:: 2021
Address:: Online
Editors:: Chengqing Zong, Fei Xia, Wenjie Li, Roberto Navigli
Venues:: ACL | IJCNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1435–1450
Language:
URL:: https://aclanthology.org/2021.acl-long.114/
DOI:: 10.18653/v1/2021.acl-long.114
Bibkey:
Cite (ACL):: Peter West, Ximing Lu, Ari Holtzman, Chandra Bhagavatula, Jena D. Hwang, and Yejin Choi. 2021. Reflective Decoding: Beyond Unidirectional Generation with Off-the-Shelf Language Models. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 1435–1450, Online. Association for Computational Linguistics.
Cite (Informal):: Reflective Decoding: Beyond Unidirectional Generation with Off-the-Shelf Language Models (West et al., ACL-IJCNLP 2021)
Copy Citation:
PDF:: https://aclanthology.org/2021.acl-long.114.pdf
Video:: https://aclanthology.org/2021.acl-long.114.mp4

PDF Cite Search Video Fix data