Whose Language Counts as High Quality? Measuring Language Ideologies in Text Data Selection Suchin Gururangan author Dallas Card author Sarah Dreier author Emily Gade author Leroy Wang author Zeyu Wang author Luke Zettlemoyer author Noah A Smith author 2022-12 text Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing Yoav Goldberg editor Zornitsa Kozareva editor Yue Zhang editor Association for Computational Linguistics Abu Dhabi, United Arab Emirates conference publication gururangan-etal-2022-whose 10.18653/v1/2022.emnlp-main.165 https://aclanthology.org/2022.emnlp-main.165/ 2022-12 2562 2580