University at Buffalo at SemEval-2023 Task 11: MASDA–Modelling Annotator Sensibilities through DisAggregation

Michael Sullivan, Mohammed Yasin, Cassandra L. Jacobs


Abstract
Modeling the most likely label when an annotation task is perspective-dependent discards relevant sources of variation that come from the annotators themselves. We present three approaches to modeling the controversiality of a particular text. First, we explicitly represented annotators using annotator embeddings to predict the training signals of each annotator’s selections in addition to a majority class label. This method leads to reduction in error relative to models without these features, allowing the overall result to influence the weights of each annotator on the final prediction. In a second set of experiments, annotators were not modeled individually but instead annotator judgments were combined in a pairwise fashion that allowed us to implicitly combine annotators. Overall, we found that aggregating and explicitly comparing annotators’ responses to a static document representation produced high-quality predictions in all datasets, though some systems struggle to account for large or variable numbers of annotators.
Anthology ID:
2023.semeval-1.135
Volume:
Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023)
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Atul Kr. Ojha, A. Seza Doğruöz, Giovanni Da San Martino, Harish Tayyar Madabushi, Ritesh Kumar, Elisa Sartori
Venue:
SemEval
SIG:
SIGLEX
Publisher:
Association for Computational Linguistics
Note:
Pages:
978–985
Language:
URL:
https://aclanthology.org/2023.semeval-1.135
DOI:
10.18653/v1/2023.semeval-1.135
Bibkey:
Cite (ACL):
Michael Sullivan, Mohammed Yasin, and Cassandra L. Jacobs. 2023. University at Buffalo at SemEval-2023 Task 11: MASDA–Modelling Annotator Sensibilities through DisAggregation. In Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), pages 978–985, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
University at Buffalo at SemEval-2023 Task 11: MASDA–Modelling Annotator Sensibilities through DisAggregation (Sullivan et al., SemEval 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.semeval-1.135.pdf
Video:
 https://aclanthology.org/2023.semeval-1.135.mp4