Improving Zero-Shot Event Extraction via Sentence Simplification

Sneha Mehta, Huzefa Rangwala, Naren Ramakrishnan


Abstract
The success of sites such as ACLED and Our World in Data have demonstrated the massive utility of extracting events in structured formats from large volumes of textual data in the formof news, social media, blogs and discussion forums. Event extraction can provide a window into ongoing geopolitical crises and yield actionable intelligence. In this work, we cast socio-political event extraction as a machine reading comprehension (MRC) task. % With the proliferation of large pretrained language models Machine Reading Comprehension (MRC) has emerged as a new paradigm for event extraction in recent times. In this approach, extraction of social-political actors and targets from a sentence is framed as an extractive question-answering problem conditioned on an event type. There are several advantages of using MRC for this task including the ability to leverage large pretrained multilingual language models and their ability to perform zero-shot extraction. Moreover, we find that the problem of long-range dependencies, i.e., large lexical distance between trigger and argument words and the difficulty of processing syntactically complex sentences plague MRC-based approaches. To address this, we present a general approach to improve the performance of MRC-based event extraction by performing unsupervised sentence simplification guided by the MRC model itself. We evaluate our approach on the ICEWS geopolitical event extraction dataset, with specific attention to ‘Actor’ and ‘Target’ argument roles. We show how such context simplification can improve the performance of MRC-based event extraction by more than 5% for actor extraction and more than 10% for target extraction.
Anthology ID:
2022.case-1.5
Volume:
Proceedings of the 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE)
Month:
December
Year:
2022
Address:
Abu Dhabi, United Arab Emirates (Hybrid)
Editors:
Ali Hürriyetoğlu, Hristo Tanev, Vanni Zavarella, Erdem Yörük
Venue:
CASE
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
32–43
Language:
URL:
https://aclanthology.org/2022.case-1.5
DOI:
10.18653/v1/2022.case-1.5
Bibkey:
Cite (ACL):
Sneha Mehta, Huzefa Rangwala, and Naren Ramakrishnan. 2022. Improving Zero-Shot Event Extraction via Sentence Simplification. In Proceedings of the 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE), pages 32–43, Abu Dhabi, United Arab Emirates (Hybrid). Association for Computational Linguistics.
Cite (Informal):
Improving Zero-Shot Event Extraction via Sentence Simplification (Mehta et al., CASE 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.case-1.5.pdf
Video:
 https://aclanthology.org/2022.case-1.5.mp4