Human Rationales as Attribution Priors for Explainable Stance Detection

Sahil Jayaram, Emily Allaway


Abstract
As NLP systems become better at detecting opinions and beliefs from text, it is important to ensure not only that models are accurate but also that they arrive at their predictions in ways that align with human reasoning. In this work, we present a method for imparting human-like rationalization to a stance detection model using crowdsourced annotations on a small fraction of the training data. We show that in a data-scarce setting, our approach can improve the reasoning of a state-of-the-art classifier—particularly for inputs containing challenging phenomena such as sarcasm—at no cost in predictive performance. Furthermore, we demonstrate that attention weights surpass a leading attribution method in providing faithful explanations of our model’s predictions, thus serving as a computationally cheap and reliable source of attributions for our model.
Anthology ID:
2021.emnlp-main.450
Volume:
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
Month:
November
Year:
2021
Address:
Online and Punta Cana, Dominican Republic
Editors:
Marie-Francine Moens, Xuanjing Huang, Lucia Specia, Scott Wen-tau Yih
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
5540–5554
Language:
URL:
https://aclanthology.org/2021.emnlp-main.450
DOI:
10.18653/v1/2021.emnlp-main.450
Bibkey:
Cite (ACL):
Sahil Jayaram and Emily Allaway. 2021. Human Rationales as Attribution Priors for Explainable Stance Detection. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 5540–5554, Online and Punta Cana, Dominican Republic. Association for Computational Linguistics.
Cite (Informal):
Human Rationales as Attribution Priors for Explainable Stance Detection (Jayaram & Allaway, EMNLP 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.emnlp-main.450.pdf
Video:
 https://aclanthology.org/2021.emnlp-main.450.mp4
Code
 sahilj97/explainable-stance-detection