Evaluating Biases in Context-Dependent Sexual and Reproductive Health Questions

Sharon Levy, Tahilin Karver, William Adler, Michelle Kaufman, Mark Dredze


Abstract
Chat-based large language models have the opportunity to empower individuals lacking high-quality healthcare access to receive personalized information across a variety of topics. However, users may ask underspecified questions that require additional context for a model to correctly answer. We study how large language model biases are exhibited through these contextual questions in the healthcare domain. To accomplish this, we curate a dataset of sexual and reproductive healthcare questions (ContextSRH) that are dependent on age, sex, and location attributes. We compare models’ outputs with and without demographic context to determine answer alignment among our contextual questions. Our experiments reveal biases in each of these attributes, where young adult female users are favored.
Anthology ID:
2024.findings-emnlp.332
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2024
Month:
November
Year:
2024
Address:
Miami, Florida, USA
Editors:
Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
5801–5812
Language:
URL:
https://aclanthology.org/2024.findings-emnlp.332
DOI:
Bibkey:
Cite (ACL):
Sharon Levy, Tahilin Karver, William Adler, Michelle Kaufman, and Mark Dredze. 2024. Evaluating Biases in Context-Dependent Sexual and Reproductive Health Questions. In Findings of the Association for Computational Linguistics: EMNLP 2024, pages 5801–5812, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):
Evaluating Biases in Context-Dependent Sexual and Reproductive Health Questions (Levy et al., Findings 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.findings-emnlp.332.pdf
Data:
 2024.findings-emnlp.332.data.zip