Cross-lingual Inference with A Chinese Entailment Graph

Tianyi Li, Sabine Weber, Mohammad Javad Hosseini, Liane Guillou, Mark Steedman


Abstract
Predicate entailment detection is a crucial task for question-answering from text, where previous work has explored unsupervised learning of entailment graphs from typed open relation triples. In this paper, we present the first pipeline for building Chinese entailment graphs, which involves a novel high-recall open relation extraction (ORE) method and the first Chinese fine-grained entity typing dataset under the FIGER type ontology. Through experiments on the Levy-Holt dataset, we verify the strength of our Chinese entailment graph, and reveal the cross-lingual complementarity: on the parallel Levy-Holt dataset, an ensemble of Chinese and English entailment graphs outperforms both monolingual graphs, and raises unsupervised SOTA by 4.7 AUC points.
Anthology ID:
2022.findings-acl.96
Volume:
Findings of the Association for Computational Linguistics: ACL 2022
Month:
May
Year:
2022
Address:
Dublin, Ireland
Editors:
Smaranda Muresan, Preslav Nakov, Aline Villavicencio
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1214–1233
Language:
URL:
https://aclanthology.org/2022.findings-acl.96
DOI:
10.18653/v1/2022.findings-acl.96
Bibkey:
Cite (ACL):
Tianyi Li, Sabine Weber, Mohammad Javad Hosseini, Liane Guillou, and Mark Steedman. 2022. Cross-lingual Inference with A Chinese Entailment Graph. In Findings of the Association for Computational Linguistics: ACL 2022, pages 1214–1233, Dublin, Ireland. Association for Computational Linguistics.
Cite (Informal):
Cross-lingual Inference with A Chinese Entailment Graph (Li et al., Findings 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.findings-acl.96.pdf
Software:
 2022.findings-acl.96.software.zip
Video:
 https://aclanthology.org/2022.findings-acl.96.mp4
Code
 teddy-li/chineseentgraph
Data
CLUEFIGER