Do Large Language Models Adapt to Language Variation across Socioeconomic Status?

Elisa Bassignana; Mike Zhang; Dirk Hovy; Amanda Cercas Curry

Do Large Language Models Adapt to Language Variation across Socioeconomic Status?

Elisa Bassignana, Mike Zhang, Dirk Hovy, Amanda Cercas Curry

Abstract

Humans adjust their linguistic style to the audience they are addressing. However, the extent to which LLMs adapt to different social contexts is largely unknown. As these models increasingly mediate human-to-human communication, their failure to adapt to diverse styles can perpetuate stereotypes and marginalize communities whose linguistic norms are less closely mirrored by the models, thereby reinforcing social stratification. We study the extent to which LLMs integrate into social media communication across different socioeconomic status (SES) communities. We collect a novel dataset from Reddit and YouTube, stratified by SES. We prompt four LLMs with incomplete text from that corpus and compare the LLM-generated completions to the originals along 94 sociolinguistic metrics, including syntactic, rhetorical, and lexical features. LLMs modulate their style with respect to SES to only a minor extent, often resulting in approximation or caricature, and tend to emulate the style of upper SES more effectively. Our findings (1) show how LLMs risk amplifying linguistic hierarchies and (2) call into question their validity for agent-based social simulation, survey experiments, and any research relying on language style as a social signal.

Anthology ID:: 2026.vardial-1.26
Volume:: Proceedings of the 13th Workshop on NLP for Similar Languages, Varieties and Dialects
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Venues:: VarDial | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 317–338
Language:
URL:: https://aclanthology.org/2026.vardial-1.26/
DOI:
Bibkey:
Cite (ACL):: Elisa Bassignana, Mike Zhang, Dirk Hovy, and Amanda Cercas Curry. 2026. Do Large Language Models Adapt to Language Variation across Socioeconomic Status?. In Proceedings of the 13th Workshop on NLP for Similar Languages, Varieties and Dialects, pages 317–338, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: Do Large Language Models Adapt to Language Variation across Socioeconomic Status? (Bassignana et al., VarDial 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.vardial-1.26.pdf

PDF Cite Search Fix data