Chinese Relation Extraction with Multi-Grained Information and External Linguistic Knowledge

Ziran Li, Ning Ding, Zhiyuan Liu, Haitao Zheng, Ying Shen


Abstract
Chinese relation extraction is conducted using neural networks with either character-based or word-based inputs, and most existing methods typically suffer from segmentation errors and ambiguity of polysemy. To address the issues, we propose a multi-grained lattice framework (MG lattice) for Chinese relation extraction to take advantage of multi-grained language information and external linguistic knowledge. In this framework, (1) we incorporate word-level information into character sequence inputs so that segmentation errors can be avoided. (2) We also model multiple senses of polysemous words with the help of external linguistic knowledge, so as to alleviate polysemy ambiguity. Experiments on three real-world datasets in distinct domains show consistent and significant superiority and robustness of our model, as compared with other baselines. We will release the source code of this paper in the future.
Anthology ID:
P19-1430
Volume:
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
Month:
July
Year:
2019
Address:
Florence, Italy
Editors:
Anna Korhonen, David Traum, Lluís Màrquez
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
4377–4386
Language:
URL:
https://aclanthology.org/P19-1430/
DOI:
10.18653/v1/P19-1430
Bibkey:
Cite (ACL):
Ziran Li, Ning Ding, Zhiyuan Liu, Haitao Zheng, and Ying Shen. 2019. Chinese Relation Extraction with Multi-Grained Information and External Linguistic Knowledge. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 4377–4386, Florence, Italy. Association for Computational Linguistics.
Cite (Informal):
Chinese Relation Extraction with Multi-Grained Information and External Linguistic Knowledge (Li et al., ACL 2019)
Copy Citation:
PDF:
https://aclanthology.org/P19-1430.pdf
Video:
 https://aclanthology.org/P19-1430.mp4
Code
 thunlp/Chinese_NRE