Open Information Extraction via Chunks

Kuicai Dong, Aixin Sun, Jung-jae Kim, Xiaoli Li


Abstract
Open Information Extraction (OIE) aims to extract relational tuples from open-domain sentences. Existing OIE systems split a sentence into tokens and recognize token spans as tuple relations and arguments. We instead propose Sentence as Chunk sequence (SaC) and recognize chunk spans as tuple relations and arguments. We argue that SaC has better properties for OIE than sentence as token sequence, and evaluate four choices of chunks (i.e., CoNLL chunks, OIA simple phrases, noun phrases, and spans from SpanOIE). Also, we propose a simple end-to-end BERT-based model, Chunk-OIE, for sentence chunking and tuple extraction on top of SaC. Chunk-OIE achieves state-of-the-art results on multiple OIE datasets, showing that SaC benefits the OIE task.
Anthology ID:
2023.emnlp-main.951
Volume:
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
Month:
December
Year:
2023
Address:
Singapore
Editors:
Houda Bouamor, Juan Pino, Kalika Bali
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
15390–15404
Language:
URL:
https://aclanthology.org/2023.emnlp-main.951
DOI:
10.18653/v1/2023.emnlp-main.951
Bibkey:
Cite (ACL):
Kuicai Dong, Aixin Sun, Jung-jae Kim, and Xiaoli Li. 2023. Open Information Extraction via Chunks. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 15390–15404, Singapore. Association for Computational Linguistics.
Cite (Informal):
Open Information Extraction via Chunks (Dong et al., EMNLP 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.emnlp-main.951.pdf
Video:
 https://aclanthology.org/2023.emnlp-main.951.mp4