CompactIE: Compact Facts in Open Information Extraction

Farima Fatahi Bayat; Nikita Bhutani; H. Jagadish

doi:10.18653/v1/2022.naacl-main.65

CompactIE: Compact Facts in Open Information Extraction

Farima Fatahi Bayat, Nikita Bhutani, H. Jagadish

Abstract

A major drawback of modern neural OpenIE systems and benchmarks is that they prioritize high coverage of information in extractions over compactness of their constituents. This severely limits the usefulness of OpenIE extractions in many downstream tasks. The utility of extractions can be improved if extractions are compact and share constituents. To this end, we study the problem of identifying compact extractions with neural-based methods. We propose CompactIE, an OpenIE system that uses a novel pipelined approach to produce compact extractions with overlapping constituents. It first detects constituents of the extractions and then links them to build extractions. We train our system on compact extractions obtained by processing existing benchmarks. Our experiments on CaRB and Wire57 datasets indicate that CompactIE finds 1.5x-2x more compact extractions than previous systems, with high precision, establishing a new state-of-the-art performance in OpenIE.

Anthology ID:: 2022.naacl-main.65
Volume:: Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Month:: July
Year:: 2022
Address:: Seattle, United States
Editors:: Marine Carpuat, Marie-Catherine de Marneffe, Ivan Vladimir Meza Ruiz
Venue:: NAACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 900–910
Language:
URL:: https://aclanthology.org/2022.naacl-main.65
DOI:: 10.18653/v1/2022.naacl-main.65
Bibkey:
Cite (ACL):: Farima Fatahi Bayat, Nikita Bhutani, and H. Jagadish. 2022. CompactIE: Compact Facts in Open Information Extraction. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 900–910, Seattle, United States. Association for Computational Linguistics.
Cite (Informal):: CompactIE: Compact Facts in Open Information Extraction (Fatahi Bayat et al., NAACL 2022)
Copy Citation:
PDF:: https://aclanthology.org/2022.naacl-main.65.pdf
Video:: https://aclanthology.org/2022.naacl-main.65.mp4
Code: farimafatahi/compactie
Data: BenchIE, WiRe57

PDF Cite Search Code Video