Agree, Disagree, Explain: Decomposing Human Label Variation in NLI through the Lens of Explanations

Pingjun Hong; Beiduo Chen; Siyao Peng; Marie-Catherine de Marneffe; Benjamin Roth; Barbara Plank

Agree, Disagree, Explain: Decomposing Human Label Variation in NLI through the Lens of Explanations

Pingjun Hong, Beiduo Chen, Siyao Peng, Marie-Catherine de Marneffe, Benjamin Roth, Barbara Plank

Abstract

Natural Language Inference (NLI) datasets often exhibit human label variation. To better understand these variations, explanation-based approaches analyze the underlying reasoning behind annotators’ decisions. One such approach is the LiTEx taxonomy, which categorizes free-text explanations in English into reasoning categories. However, previous work applying LiTEx has focused on within-label variation: cases where annotators agree on the NLI label but provide different explanations. This paper broadens the scope by examining how annotators may diverge not only in the reasoning category but also in the labeling. We use explanations as a lens to analyze variation in NLI annotations and to examine individual differences in reasoning. We apply LiTEx to two NLI datasets and align annotation variation from multiple aspects: NLI label agreement, explanation similarity, and taxonomy agreement, with an additional compounding factor of annotators’ selection bias. We observe instances where annotators disagree on the label but provide similar explanations, suggesting that surface-level disagreement may mask underlying agreement in interpretation. Moreover, our analysis reveals individual preferences in explanation strategies and label choices. These findings highlight that agreement in reasoning categories better reflects the semantic similarity of explanations than label agreement alone. Our findings underscore the richness of reasoning-based explanations and the need for caution in treating labels as ground truth.

Anthology ID:: 2026.findings-acl.1342
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 26922–26934
Language:
URL:: https://aclanthology.org/2026.findings-acl.1342/
DOI:
Bibkey:
Cite (ACL):: Pingjun Hong, Beiduo Chen, Siyao Peng, Marie-Catherine de Marneffe, Benjamin Roth, and Barbara Plank. 2026. Agree, Disagree, Explain: Decomposing Human Label Variation in NLI through the Lens of Explanations. In Findings of the Association for Computational Linguistics: ACL 2026, pages 26922–26934, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Agree, Disagree, Explain: Decomposing Human Label Variation in NLI through the Lens of Explanations (Hong et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.1342.pdf
Checklist:: 2026.findings-acl.1342.checklist.pdf

PDF Cite Search Checklist Fix data