Detoxifying Language Models Risks Marginalizing Minority Voices Albert Xu author Eshaan Pathak author Eric Wallace author Suchin Gururangan author Maarten Sap author Dan Klein author 2021-06 text Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies Kristina Toutanova editor Anna Rumshisky editor Luke Zettlemoyer editor Dilek Hakkani-Tur editor Iz Beltagy editor Steven Bethard editor Ryan Cotterell editor Tanmoy Chakraborty editor Yichao Zhou editor Association for Computational Linguistics Online conference publication xu-etal-2021-detoxifying 10.18653/v1/2021.naacl-main.190 https://aclanthology.org/2021.naacl-main.190/ 2021-06 2390 2397