Explaining Why: How Instructions and User Interfaces Impact Annotator Rationales When Labeling Text Data

Jamar Sullivan Jr., Will Brackenbury, Andrew McNutt, Kevin Bryson, Kwam Byll, Yuxin Chen, Michael Littman, Chenhao Tan, Blase Ur


Abstract
In the context of data labeling, NLP researchers are increasingly interested in having humans select rationales, a subset of input tokens relevant to the chosen label. We conducted a 332-participant online user study to understand how humans select rationales, especially how different instructions and user interface affordances impact the rationales chosen. Participants labeled ten movie reviews as positive or negative, selecting words and phrases supporting their label as rationales. We varied the instructions given, the rationale-selection task, and the user interface. Participants often selected about 12% of input tokens as rationales, but selected fewer if unable to drag over multiple tokens at once. Whereas participants were near unanimous in their data labels, they were far less consistent in their rationales. The user interface affordances and task greatly impacted the types of rationales chosen. We also observed large variance across participants.
Anthology ID:
2022.naacl-main.38
Volume:
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Month:
July
Year:
2022
Address:
Seattle, United States
Editors:
Marine Carpuat, Marie-Catherine de Marneffe, Ivan Vladimir Meza Ruiz
Venue:
NAACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
521–531
Language:
URL:
https://aclanthology.org/2022.naacl-main.38
DOI:
10.18653/v1/2022.naacl-main.38
Bibkey:
Cite (ACL):
Jamar Sullivan Jr., Will Brackenbury, Andrew McNutt, Kevin Bryson, Kwam Byll, Yuxin Chen, Michael Littman, Chenhao Tan, and Blase Ur. 2022. Explaining Why: How Instructions and User Interfaces Impact Annotator Rationales When Labeling Text Data. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 521–531, Seattle, United States. Association for Computational Linguistics.
Cite (Informal):
Explaining Why: How Instructions and User Interfaces Impact Annotator Rationales When Labeling Text Data (Sullivan Jr. et al., NAACL 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.naacl-main.38.pdf
Software:
 2022.naacl-main.38.software.zip
Video:
 https://aclanthology.org/2022.naacl-main.38.mp4