Leveraging syntactic parsing to improve event annotation matching

Camiel Colruyt, Orphée De Clercq, Véronique Hoste


Abstract
Detecting event mentions is the first step in event extraction from text and annotating them is a notoriously difficult task. Evaluating annotator consistency is crucial when building datasets for mention detection. When event mentions are allowed to cover many tokens, annotators may disagree on their span, which means that overlapping annotations may then refer to the same event or to different events. This paper explores different fuzzy-matching functions which aim to resolve this ambiguity. The functions extract the sets of syntactic heads present in the annotations, use the Dice coefficient to measure the similarity between sets and return a judgment based on a given threshold. The functions are tested against the judgment of a human evaluator and a comparison is made between sets of tokens and sets of syntactic heads. The best-performing function is a head-based function that is found to agree with the human evaluator in 89% of cases.
Anthology ID:
D19-5903
Volume:
Proceedings of the First Workshop on Aggregating and Analysing Crowdsourced Annotations for NLP
Month:
November
Year:
2019
Address:
Hong Kong
Editors:
Silviu Paun, Dirk Hovy
Venue:
WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
15–23
Language:
URL:
https://aclanthology.org/D19-5903
DOI:
10.18653/v1/D19-5903
Bibkey:
Cite (ACL):
Camiel Colruyt, Orphée De Clercq, and Véronique Hoste. 2019. Leveraging syntactic parsing to improve event annotation matching. In Proceedings of the First Workshop on Aggregating and Analysing Crowdsourced Annotations for NLP, pages 15–23, Hong Kong. Association for Computational Linguistics.
Cite (Informal):
Leveraging syntactic parsing to improve event annotation matching (Colruyt et al., 2019)
Copy Citation:
PDF:
https://aclanthology.org/D19-5903.pdf