A Neural Model for Aggregating Coreference Annotation in Crowdsourcing

Maolin Li, Hiroya Takamura, Sophia Ananiadou


Abstract
Coreference resolution is the task of identifying all mentions in a text that refer to the same real-world entity. Collecting sufficient labelled data from expert annotators to train a high-performance coreference resolution system is time-consuming and expensive. Crowdsourcing makes it possible to obtain the required amounts of data rapidly and cost-effectively. However, crowd-sourced labels can be noisy. To ensure high-quality data, it is crucial to infer the correct labels by aggregating the noisy labels. In this paper, we split the aggregation into two subtasks, i.e, mention classification and coreference chain inference. Firstly, we predict the general class of each mention using an autoencoder, which incorporates contextual information about each mention, while at the same time taking into account the mention’s annotation complexity and annotators’ reliability at different levels. Secondly, to determine the coreference chain of each mention, we use weighted voting which takes into account the learned reliability in the first subtask. Experimental results demonstrate the effectiveness of our method in predicting the correct labels. We also illustrate our model’s interpretability through a comprehensive analysis of experimental results.
Anthology ID:
2020.coling-main.507
Volume:
Proceedings of the 28th International Conference on Computational Linguistics
Month:
December
Year:
2020
Address:
Barcelona, Spain (Online)
Editors:
Donia Scott, Nuria Bel, Chengqing Zong
Venue:
COLING
SIG:
Publisher:
International Committee on Computational Linguistics
Note:
Pages:
5760–5773
Language:
URL:
https://aclanthology.org/2020.coling-main.507
DOI:
10.18653/v1/2020.coling-main.507
Bibkey:
Cite (ACL):
Maolin Li, Hiroya Takamura, and Sophia Ananiadou. 2020. A Neural Model for Aggregating Coreference Annotation in Crowdsourcing. In Proceedings of the 28th International Conference on Computational Linguistics, pages 5760–5773, Barcelona, Spain (Online). International Committee on Computational Linguistics.
Cite (Informal):
A Neural Model for Aggregating Coreference Annotation in Crowdsourcing (Li et al., COLING 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.coling-main.507.pdf