Mitigating Spurious Correlation in Natural Language Understanding with Counterfactual Inference

Can Udomcharoenchaikit; Wuttikorn Ponwitayarat; Patomporn Payoungkhamdee; Kanruethai Masuk; Weerayut Buaphet; Ekapol Chuangsuwanich; Sarana Nutanong

doi:10.18653/v1/2022.emnlp-main.777

Mitigating Spurious Correlation in Natural Language Understanding with Counterfactual Inference

Can Udomcharoenchaikit, Wuttikorn Ponwitayarat, Patomporn Payoungkhamdee, Kanruethai Masuk, Weerayut Buaphet, Ekapol Chuangsuwanich, Sarana Nutanong

Abstract

Despite their promising results on standard benchmarks, NLU models are still prone to make predictions based on shortcuts caused by unintended bias in the dataset. For example, an NLI model may use lexical overlap as a shortcut to make entailment predictions due to repetitive data generation patterns from annotators, also called annotation artifacts. In this paper, we propose a causal analysis framework to help debias NLU models. We show that (1) by defining causal relationships, we can introspect how much annotation artifacts affect the outcomes. (2) We can utilize counterfactual inference to mitigate bias with this knowledge. We found that viewing a model as a treatment can mitigate bias more effectively than viewing annotation artifacts as treatment. (3) In addition to bias mitigation, we can interpret how much each debiasing strategy is affected by annotation artifacts. Our experimental results show that using counterfactual inference can improve out-of-distribution performance in all settings while maintaining high in-distribution performance.

Anthology ID:: 2022.emnlp-main.777
Volume:: Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
Month:: December
Year:: 2022
Address:: Abu Dhabi, United Arab Emirates
Editors:: Yoav Goldberg, Zornitsa Kozareva, Yue Zhang
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 11308–11321
Language:
URL:: https://aclanthology.org/2022.emnlp-main.777
DOI:: 10.18653/v1/2022.emnlp-main.777
Bibkey:
Cite (ACL):: Can Udomcharoenchaikit, Wuttikorn Ponwitayarat, Patomporn Payoungkhamdee, Kanruethai Masuk, Weerayut Buaphet, Ekapol Chuangsuwanich, and Sarana Nutanong. 2022. Mitigating Spurious Correlation in Natural Language Understanding with Counterfactual Inference. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 11308–11321, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics.
Cite (Informal):: Mitigating Spurious Correlation in Natural Language Understanding with Counterfactual Inference (Udomcharoenchaikit et al., EMNLP 2022)
Copy Citation:
PDF:: https://aclanthology.org/2022.emnlp-main.777.pdf

PDF Cite Search