Modeling Bilingual Conversational Characteristics for Neural Chat Translation

Yunlong Liang, Fandong Meng, Yufeng Chen, Jinan Xu, Jie Zhou


Abstract
Neural chat translation aims to translate bilingual conversational text, which has a broad application in international exchanges and cooperation. Despite the impressive performance of sentence-level and context-aware Neural Machine Translation (NMT), there still remain challenges to translate bilingual conversational text due to its inherent characteristics such as role preference, dialogue coherence, and translation consistency. In this paper, we aim to promote the translation quality of conversational text by modeling the above properties. Specifically, we design three latent variational modules to learn the distributions of bilingual conversational characteristics. Through sampling from these learned distributions, the latent variables, tailored for role preference, dialogue coherence, and translation consistency, are incorporated into the NMT model for better translation. We evaluate our approach on the benchmark dataset BConTrasT (English<->German) and a self-collected bilingual dialogue corpus, named BMELD (English<->Chinese). Extensive experiments show that our approach notably boosts the performance over strong baselines by a large margin and significantly surpasses some state-of-the-art context-aware NMT models in terms of BLEU and TER. Additionally, we make the BMELD dataset publicly available for the research community.
Anthology ID:
2021.acl-long.444
Volume:
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
Month:
August
Year:
2021
Address:
Online
Venues:
ACL | IJCNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
5711–5724
Language:
URL:
https://aclanthology.org/2021.acl-long.444
DOI:
10.18653/v1/2021.acl-long.444
Bibkey:
Cite (ACL):
Yunlong Liang, Fandong Meng, Yufeng Chen, Jinan Xu, and Jie Zhou. 2021. Modeling Bilingual Conversational Characteristics for Neural Chat Translation. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 5711–5724, Online. Association for Computational Linguistics.
Cite (Informal):
Modeling Bilingual Conversational Characteristics for Neural Chat Translation (Liang et al., ACL 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.acl-long.444.pdf
Video:
 https://aclanthology.org/2021.acl-long.444.mp4
Code
 XL2248/CPCC
Data
BMELDMELDTaskmaster-1