Natural Language Inference with Mixed Effects

William Gantt, Benjamin Kane, Aaron Steven White


Abstract
There is growing evidence that the prevalence of disagreement in the raw annotations used to construct natural language inference datasets makes the common practice of aggregating those annotations to a single label problematic. We propose a generic method that allows one to skip the aggregation step and train on the raw annotations directly without subjecting the model to unwanted noise that can arise from annotator response biases. We demonstrate that this method, which generalizes the notion of a mixed effects model by incorporating annotator random effects into any existing neural model, improves performance over models that do not incorporate such effects.
Anthology ID:
2020.starsem-1.9
Original:
2020.starsem-1.9v1
Version 2:
2020.starsem-1.9v2
Volume:
Proceedings of the Ninth Joint Conference on Lexical and Computational Semantics
Month:
December
Year:
2020
Address:
Barcelona, Spain (Online)
Editors:
Iryna Gurevych, Marianna Apidianaki, Manaal Faruqui
Venue:
*SEM
SIG:
SIGLEX
Publisher:
Association for Computational Linguistics
Note:
Pages:
81–87
Language:
URL:
https://aclanthology.org/2020.starsem-1.9
DOI:
Bibkey:
Cite (ACL):
William Gantt, Benjamin Kane, and Aaron Steven White. 2020. Natural Language Inference with Mixed Effects. In Proceedings of the Ninth Joint Conference on Lexical and Computational Semantics, pages 81–87, Barcelona, Spain (Online). Association for Computational Linguistics.
Cite (Informal):
Natural Language Inference with Mixed Effects (Gantt et al., *SEM 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.starsem-1.9.pdf
Code
 wgantt/nli-mixed-models
Data
MegaNegRaisingMegaVeridicality