Diversity-Aware Annotation for Conversational AI Safety

Alicia Parrish, Vinodkumar Prabhakaran, Lora Aroyo, Mark Díaz, Christopher M. Homan, Greg Serapio-García, Alex S. Taylor, Ding Wang


Abstract
How people interpret content is deeply influenced by their socio-cultural backgrounds and lived experiences. This is especially crucial when evaluating AI systems for safety, where accounting for such diversity in interpretations and potential impacts on human users will make them both more successful and inclusive. While recent work has demonstrated the importance of diversity in human ratings that underlie AI pipelines, effective and efficient ways to incorporate diverse perspectives in human data annotation pipelines is still largely elusive. In this paper, we discuss the primary challenges faced in incorporating diversity into model evaluations, and propose a practical diversity-aware annotation approach. Using an existing dataset with highly parallel safety annotations, we take as a test case a policy that prioritizes recall of safety issues, and demonstrate that our diversity-aware approach can efficiently obtain a higher recall of safety issues flagged by minoritized rater groups without hurting overall precision.
Anthology ID:
2024.safety4convai-1.2
Volume:
Proceedings of Safety4ConvAI: The Third Workshop on Safety for Conversational AI @ LREC-COLING 2024
Month:
May
Year:
2024
Address:
Torino, Italia
Editors:
Tanvi Dinkar, Giuseppe Attanasio, Amanda Cercas Curry, Ioannis Konstas, Dirk Hovy, Verena Rieser
Venues:
Safety4ConvAI | WS
SIG:
Publisher:
ELRA and ICCL
Note:
Pages:
8–15
Language:
URL:
https://aclanthology.org/2024.safety4convai-1.2
DOI:
Bibkey:
Cite (ACL):
Alicia Parrish, Vinodkumar Prabhakaran, Lora Aroyo, Mark Díaz, Christopher M. Homan, Greg Serapio-García, Alex S. Taylor, and Ding Wang. 2024. Diversity-Aware Annotation for Conversational AI Safety. In Proceedings of Safety4ConvAI: The Third Workshop on Safety for Conversational AI @ LREC-COLING 2024, pages 8–15, Torino, Italia. ELRA and ICCL.
Cite (Informal):
Diversity-Aware Annotation for Conversational AI Safety (Parrish et al., Safety4ConvAI-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.safety4convai-1.2.pdf