Types of Out-of-Distribution Texts and How to Detect Them

Udit Arora, William Huang, He He


Abstract
Despite agreement on the importance of detecting out-of-distribution (OOD) examples, there is little consensus on the formal definition of the distribution shifts of OOD examples and how to best detect them. We categorize these examples as exhibiting a background shift or semantic shift, and find that the two major approaches to OOD detection, calibration and density estimation (language modeling for text), have distinct behavior on these types of OOD data. Across 14 pairs of in-distribution and OOD English natural language understanding datasets, we find that density estimation methods consistently beat calibration methods in background shift settings and perform worse in semantic shift settings. In addition, we find that both methods generally fail to detect examples from challenge data, indicating that these examples constitute a different type of OOD data. Overall, while the categorization we apply explains many of the differences between the two methods, our results call for a more explicit definition of OOD to create better benchmarks and build detectors that can target the type of OOD data expected at test time.
Anthology ID:
2021.emnlp-main.835
Volume:
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
Month:
November
Year:
2021
Address:
Online and Punta Cana, Dominican Republic
Editors:
Marie-Francine Moens, Xuanjing Huang, Lucia Specia, Scott Wen-tau Yih
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
10687–10701
Language:
URL:
https://aclanthology.org/2021.emnlp-main.835
DOI:
10.18653/v1/2021.emnlp-main.835
Bibkey:
Cite (ACL):
Udit Arora, William Huang, and He He. 2021. Types of Out-of-Distribution Texts and How to Detect Them. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 10687–10701, Online and Punta Cana, Dominican Republic. Association for Computational Linguistics.
Cite (Informal):
Types of Out-of-Distribution Texts and How to Detect Them (Arora et al., EMNLP 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.emnlp-main.835.pdf
Video:
 https://aclanthology.org/2021.emnlp-main.835.mp4
Code
 uditarora/ood-text-emnlp
Data
Civil CommentsDBpediaGLUEIMDb Movie ReviewsMultiNLIRTESNLISSTSST-2