Cross-lingual Knowledge Projection Using Machine Translation and Target-side Knowledge Base Completion

Naoki Otani, Hirokazu Kiyomaru, Daisuke Kawahara, Sadao Kurohashi


Abstract
Considerable effort has been devoted to building commonsense knowledge bases. However, they are not available in many languages because the construction of KBs is expensive. To bridge the gap between languages, this paper addresses the problem of projecting the knowledge in English, a resource-rich language, into other languages, where the main challenge lies in projection ambiguity. This ambiguity is partially solved by machine translation and target-side knowledge base completion, but neither of them is adequately reliable by itself. We show their combination can project English commonsense knowledge into Japanese and Chinese with high precision. Our method also achieves a top-10 accuracy of 90% on the crowdsourced English–Japanese benchmark. Furthermore, we use our method to obtain 18,747 facts of accurate Japanese commonsense within a very short period.
Anthology ID:
C18-1128
Volume:
Proceedings of the 27th International Conference on Computational Linguistics
Month:
August
Year:
2018
Address:
Santa Fe, New Mexico, USA
Editors:
Emily M. Bender, Leon Derczynski, Pierre Isabelle
Venue:
COLING
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1508–1520
Language:
URL:
https://aclanthology.org/C18-1128
DOI:
Bibkey:
Cite (ACL):
Naoki Otani, Hirokazu Kiyomaru, Daisuke Kawahara, and Sadao Kurohashi. 2018. Cross-lingual Knowledge Projection Using Machine Translation and Target-side Knowledge Base Completion. In Proceedings of the 27th International Conference on Computational Linguistics, pages 1508–1520, Santa Fe, New Mexico, USA. Association for Computational Linguistics.
Cite (Informal):
Cross-lingual Knowledge Projection Using Machine Translation and Target-side Knowledge Base Completion (Otani et al., COLING 2018)
Copy Citation:
PDF:
https://aclanthology.org/C18-1128.pdf
Code
 notani/CLKP-MTKBC
Data
ConceptNet