DaKultur: Evaluating the Cultural Awareness of Language Models for Danish with Native Speakers

Max Müller-Eberstein; Mike Zhang; Elisa Bassignana; Peter Brunsgaard Trolle; Rob Van Der Goot

doi:10.18653/v1/2025.c3nlp-1.5

DaKultur: Evaluating the Cultural Awareness of Language Models for Danish with Native Speakers

Max Müller-Eberstein, Mike Zhang, Elisa Bassignana, Peter Brunsgaard Trolle, Rob Van Der Goot

Abstract

Large Language Models (LLMs) have seen widespread societal adoption. However, while they are able to interact with users in languages beyond English, they have been shown to lack cultural awareness, providing anglocentric or inappropriate responses for underrepresented language communities. To investigate this gap and disentangle linguistic versus cultural proficiency, we conduct the first cultural evaluation study for the mid-resource language of Danish, in which native speakers prompt different models to solve tasks requiring cultural awareness. Our analysis of the resulting 1,038 interactions from 63 demographically diverse participants highlights open challenges to cultural adaptation: Particularly, how currently employed automatically translated data are insufficient to train or measure cultural adaptation, and how training on native-speaker data can more than double response acceptance rates. We release our study data as DaKultur - the first native Danish cultural awareness dataset.

Anthology ID:: 2025.c3nlp-1.5
Volume:: Proceedings of the 3rd Workshop on Cross-Cultural Considerations in NLP (C3NLP 2025)
Month:: May
Year:: 2025
Address:: Albuquerque, New Mexico
Editors:: Vinodkumar Prabhakaran, Sunipa Dev, Luciana Benotti, Daniel Hershcovich, Yong Cao, Li Zhou, Laura Cabello, Ife Adebara
Venues:: C3NLP | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 50–58
Language:
URL:: https://aclanthology.org/2025.c3nlp-1.5/
DOI:: 10.18653/v1/2025.c3nlp-1.5
Bibkey:
Cite (ACL):: Max Müller-Eberstein, Mike Zhang, Elisa Bassignana, Peter Brunsgaard Trolle, and Rob Van Der Goot. 2025. DaKultur: Evaluating the Cultural Awareness of Language Models for Danish with Native Speakers. In Proceedings of the 3rd Workshop on Cross-Cultural Considerations in NLP (C3NLP 2025), pages 50–58, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):: DaKultur: Evaluating the Cultural Awareness of Language Models for Danish with Native Speakers (Müller-Eberstein et al., C3NLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.c3nlp-1.5.pdf

PDF Cite Search Fix data