Simulating Social Attitudes with LLMs: Accuracy, Demographic Effects, and Refusal Behavior in the Sensitive Domain of Suicide Prevention

Cristina J. Perez; Michael P. Vasquez Jr; Philippe Giabbanelli; Patrick Y. Wu

Simulating Social Attitudes with LLMs: Accuracy, Demographic Effects, and Refusal Behavior in the Sensitive Domain of Suicide Prevention

Cristina J. Perez, Michael P. Vasquez Jr, Philippe Giabbanelli, Patrick Y. Wu

Abstract

Large language models (LLMs) are increasingly used to simulate public opinion, yet their validity in sensitive policy domains remains underexplored. We evaluate whether LLMs can reproduce attitudes toward suicide prevention policies using 32 questions drawn from seven nationally representative U.S. surveys (2023-2025). We systematically vary demographic conditioning (race/ethnicity, gender, age, education, income, party), prompt framing (direct elicitation, respondent embodiment, specialist embodiment), and model architecture (GPT-5 Nano, DeepSeek V3.2, Meta Llama 3.1 8B, Mistral Small 24B). Across 811,560 prompts, the mean absolute error—the average gap between predicted and human response distributions—is 23 percentage points. We also find that LLM responses to demographic-conditioned prompts diverge substantially from prompts without demographic information. In short, what distribution LLMs draw on when generating responses to sensitive polling questions remains unclear. Model choice matters more than framing for accuracy, whereas refusal behavior varies sharply across models and prompt designs. Our findings highlight the limitations of LLMs for social simulation in the context of sensitive topics.

Anthology ID:: 2026.nlpcss-1.12
Volume:: Proceedings of the Seventh Workshop on Natural Language Processing and Computational Social Science
Month:: July
Year:: 2026
Address:: San Diego
Editors:: Dallas Card, Anjalie Field, Katherine Keith, Julia Mendelsohn
Venues:: NLP+CSS | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 176–189
Language:
URL:: https://aclanthology.org/2026.nlpcss-1.12/
DOI:
Bibkey:
Cite (ACL):: Cristina J. Perez, Michael P. Vasquez Jr, Philippe Giabbanelli, and Patrick Y. Wu. 2026. Simulating Social Attitudes with LLMs: Accuracy, Demographic Effects, and Refusal Behavior in the Sensitive Domain of Suicide Prevention. In Proceedings of the Seventh Workshop on Natural Language Processing and Computational Social Science, pages 176–189, San Diego. Association for Computational Linguistics.
Cite (Informal):: Simulating Social Attitudes with LLMs: Accuracy, Demographic Effects, and Refusal Behavior in the Sensitive Domain of Suicide Prevention (Perez et al., NLP+CSS 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.nlpcss-1.12.pdf

PDF Cite Search Fix data