MICo: Preventative Detoxification of Large Language Models through Inhibition Control Roy Siegelmann author Ninareh Mehrabi author Palash Goyal author Prasoon Goyal author Lisa Bauer author Jwala Dhamala author Aram Galstyan author Rahul Gupta author Reza Ghanadan author 2024-06 text Findings of the Association for Computational Linguistics: NAACL 2024 Kevin Duh editor Helena Gomez editor Steven Bethard editor Association for Computational Linguistics Mexico City, Mexico conference publication siegelmann-etal-2024-mico 10.18653/v1/2024.findings-naacl.110 https://aclanthology.org/2024.findings-naacl.110/ 2024-06 1696 1703