MinIE: Minimizing Facts in Open Information Extraction

Kiril Gashteovski, Rainer Gemulla, Luciano del Corro


Abstract
The goal of Open Information Extraction (OIE) is to extract surface relations and their arguments from natural-language text in an unsupervised, domain-independent manner. In this paper, we propose MinIE, an OIE system that aims to provide useful, compact extractions with high precision and recall. MinIE approaches these goals by (1) representing information about polarity, modality, attribution, and quantities with semantic annotations instead of in the actual extraction, and (2) identifying and removing parts that are considered overly specific. We conducted an experimental study with several real-world datasets and found that MinIE achieves competitive or higher precision and recall than most prior systems, while at the same time producing shorter, semantically enriched extractions.
Anthology ID:
D17-1278
Volume:
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing
Month:
September
Year:
2017
Address:
Copenhagen, Denmark
Editors:
Martha Palmer, Rebecca Hwa, Sebastian Riedel
Venue:
EMNLP
SIG:
SIGDAT
Publisher:
Association for Computational Linguistics
Note:
Pages:
2630–2640
Language:
URL:
https://aclanthology.org/D17-1278
DOI:
10.18653/v1/D17-1278
Bibkey:
Cite (ACL):
Kiril Gashteovski, Rainer Gemulla, and Luciano del Corro. 2017. MinIE: Minimizing Facts in Open Information Extraction. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pages 2630–2640, Copenhagen, Denmark. Association for Computational Linguistics.
Cite (Informal):
MinIE: Minimizing Facts in Open Information Extraction (Gashteovski et al., EMNLP 2017)
Copy Citation:
PDF:
https://aclanthology.org/D17-1278.pdf
Data
New York Times Annotated Corpus