Investigating Stigmatizing Language in Clinical Documentation with Open-Source Large Language Models

Rajashree Dahal; Pardis Hosseinpour; Pranithi Kamishetty; Satwik Pamulaparthy; Saeid Tizpaz-Niari; Natalie Parde

Investigating Stigmatizing Language in Clinical Documentation with Open-Source Large Language Models

Rajashree Dahal, Pardis Hosseinpour, Pranithi Kamishetty, Satwik Pamulaparthy, Saeid Tizpaz-Niari, Natalie Parde

Abstract

Clinical documentation is essential for patient care, billing, and medical research, but it is subject to entrenched bias. While manual chart reviews can identify such bias, they are labor-intensive and expert-dependent. We introduce and evaluate StigMAD, a Multi-Agent Debate framework leveraging open-source Large Language Models (LLMs) to detect stigmatizing language in clinical documentation. We investigate reasoning (multi-agent debate), self-reflection, and self-consistency within this framework. Extensive experiments on clinical notes and patient summaries demonstrate that our framework provides significant advantages over rule-based and supervised baselines. A domain-specific LLM (MedGemma) achieved its highest performance using the StigMAD reasoning framework, while a general-purpose LLM (Llama) showed superior results with the self-consistency framework. These findings suggest that open-source LLMs, steered by structured prompting and reflective reasoning, can effectively support the scalable auditing of stigmatizing language, marking a critical step toward more equitable clinical NLP systems.

Anthology ID:: 2026.bionlp-1.39
Volume:: BioNLP 2026
Month:: July
Year:: 2026
Address:: San Diego, California
Editors:: Dina Demner-Fushman, Sophia Ananiadou, Kirk Roberts, Junichi Tsujii
Venues:: BioNLP | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 490–501
Language:
URL:: https://aclanthology.org/2026.bionlp-1.39/
DOI:
Bibkey:
Cite (ACL):: Rajashree Dahal, Pardis Hosseinpour, Pranithi Kamishetty, Satwik Pamulaparthy, Saeid Tizpaz-Niari, and Natalie Parde. 2026. Investigating Stigmatizing Language in Clinical Documentation with Open-Source Large Language Models. In BioNLP 2026, pages 490–501, San Diego, California. Association for Computational Linguistics.
Cite (Informal):: Investigating Stigmatizing Language in Clinical Documentation with Open-Source Large Language Models (Dahal et al., BioNLP 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.bionlp-1.39.pdf

PDF Cite Search Fix data