Contextualized Cross-Lingual Event Trigger Extraction with Minimal Resources

Meryem M’hamdi; Marjorie Freedman; Jonathan May

doi:10.18653/v1/K19-1061

Contextualized Cross-Lingual Event Trigger Extraction with Minimal Resources

Meryem M’hamdi, Marjorie Freedman, Jonathan May

Abstract

Event trigger extraction is an information extraction task of practical utility, yet it is challenging due to the difficulty of disambiguating word sense meaning. Previous approaches rely extensively on hand-crafted language-specific features and are applied mainly to English for which annotated datasets and Natural Language Processing (NLP) tools are available. However, the availability of such resources varies from one language to another. Recently, contextualized Bidirectional Encoder Representations from Transformers (BERT) models have established state-of-the-art performance for a variety of NLP tasks. However, there has not been much effort in exploring language transfer using BERT for event extraction. In this work, we treat event trigger extraction as a sequence tagging problem and propose a cross-lingual framework for training it without any hand-crafted features. We experiment with different flavors of transfer learning from high-resourced to low-resourced languages and compare the performance of different multilingual embeddings for event trigger extraction. Our results show that training in a multilingual setting outperforms language-specific models for both English and Chinese. Our work is the first to experiment with two event architecture variants in a cross-lingual setting, to show the effectiveness of contextualized embeddings obtained using BERT, and to explore and analyze its performance on Arabic.

Anthology ID:: K19-1061
Volume:: Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL)
Month:: November
Year:: 2019
Address:: Hong Kong, China
Editors:: Mohit Bansal, Aline Villavicencio
Venue:: CoNLL
SIG:: SIGNLL
Publisher:: Association for Computational Linguistics
Note:
Pages:: 656–665
Language:
URL:: https://aclanthology.org/K19-1061/
DOI:: 10.18653/v1/K19-1061
Bibkey:
Cite (ACL):: Meryem M’hamdi, Marjorie Freedman, and Jonathan May. 2019. Contextualized Cross-Lingual Event Trigger Extraction with Minimal Resources. In Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL), pages 656–665, Hong Kong, China. Association for Computational Linguistics.
Cite (Informal):: Contextualized Cross-Lingual Event Trigger Extraction with Minimal Resources (M’hamdi et al., CoNLL 2019)
Copy Citation:
PDF:: https://aclanthology.org/K19-1061.pdf
Attachment:: K19-1061.Attachment.pdf
Supplementary material:: K19-1061.Supplementary_Material.pdf

PDF Cite Search Attachment Supplementary material Fix data