Knowledge Graph Enhanced Neural Machine Translation via Multi-task Learning on Sub-entity Granularity

Yang Zhao; Lu Xiang; Junnan Zhu; Jiajun Zhang; Yu Zhou; Chengqing Zong

doi:10.18653/v1/2020.coling-main.397

Knowledge Graph Enhanced Neural Machine Translation via Multi-task Learning on Sub-entity Granularity

Yang Zhao, Lu Xiang, Junnan Zhu, Jiajun Zhang, Yu Zhou, Chengqing Zong

Abstract

Previous studies combining knowledge graph (KG) with neural machine translation (NMT) have two problems: i) Knowledge under-utilization: they only focus on the entities that appear in both KG and training sentence pairs, making much knowledge in KG unable to be fully utilized. ii) Granularity mismatch: the current KG methods utilize the entity as the basic granularity, while NMT utilizes the sub-word as the granularity, making the KG different to be utilized in NMT. To alleviate above problems, we propose a multi-task learning method on sub-entity granularity. Specifically, we first split the entities in KG and sentence pairs into sub-entity granularity by using joint BPE. Then we utilize the multi-task learning to combine the machine translation task and knowledge reasoning task. The extensive experiments on various translation tasks have demonstrated that our method significantly outperforms the baseline models in both translation quality and handling the entities.

Anthology ID:: 2020.coling-main.397
Volume:: Proceedings of the 28th International Conference on Computational Linguistics
Month:: December
Year:: 2020
Address:: Barcelona, Spain (Online)
Editors:: Donia Scott, Nuria Bel, Chengqing Zong
Venue:: COLING
SIG:
Publisher:: International Committee on Computational Linguistics
Note:
Pages:: 4495–4505
Language:
URL:: https://aclanthology.org/2020.coling-main.397/
DOI:: 10.18653/v1/2020.coling-main.397
Bibkey:
Cite (ACL):: Yang Zhao, Lu Xiang, Junnan Zhu, Jiajun Zhang, Yu Zhou, and Chengqing Zong. 2020. Knowledge Graph Enhanced Neural Machine Translation via Multi-task Learning on Sub-entity Granularity. In Proceedings of the 28th International Conference on Computational Linguistics, pages 4495–4505, Barcelona, Spain (Online). International Committee on Computational Linguistics.
Cite (Informal):: Knowledge Graph Enhanced Neural Machine Translation via Multi-task Learning on Sub-entity Granularity (Zhao et al., COLING 2020)
Copy Citation:
PDF:: https://aclanthology.org/2020.coling-main.397.pdf

PDF Cite Search Fix data