Learning to Recover from Multi-Modality Errors for Non-Autoregressive Neural Machine Translation

Qiu Ran; Yankai Lin (林衍凯); Peng Li; Jie Zhou

doi:10.18653/v1/2020.acl-main.277

Learning to Recover from Multi-Modality Errors for Non-Autoregressive Neural Machine Translation

Abstract

Non-autoregressive neural machine translation (NAT) predicts the entire target sequence simultaneously and significantly accelerates inference process. However, NAT discards the dependency information in a sentence, and thus inevitably suffers from the multi-modality problem: the target tokens may be provided by different possible translations, often causing token repetitions or missing. To alleviate this problem, we propose a novel semi-autoregressive model RecoverSAT in this work, which generates a translation as a sequence of segments. The segments are generated simultaneously while each segment is predicted token-by-token. By dynamically determining segment length and deleting repetitive segments, RecoverSAT is capable of recovering from repetitive and missing token errors. Experimental results on three widely-used benchmark datasets show that our proposed model achieves more than 4 times speedup while maintaining comparable performance compared with the corresponding autoregressive model.

Anthology ID:: 2020.acl-main.277
Volume:: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
Month:: July
Year:: 2020
Address:: Online
Editors:: Dan Jurafsky, Joyce Chai, Natalie Schluter, Joel Tetreault
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 3059–3069
Language:
URL:: https://aclanthology.org/2020.acl-main.277/
DOI:: 10.18653/v1/2020.acl-main.277
Bibkey:
Cite (ACL):: Qiu Ran, Yankai Lin, Peng Li, and Jie Zhou. 2020. Learning to Recover from Multi-Modality Errors for Non-Autoregressive Neural Machine Translation. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 3059–3069, Online. Association for Computational Linguistics.
Cite (Informal):: Learning to Recover from Multi-Modality Errors for Non-Autoregressive Neural Machine Translation (Ran et al., ACL 2020)
Copy Citation:
PDF:: https://aclanthology.org/2020.acl-main.277.pdf
Video:: http://slideslive.com/38929246

PDF Cite Search Video Fix data