On The Origin of Cultural Biases in Language Models: From Pre-training Data to Linguistic Phenomena

Tarek Naous; Wei Xu

doi:10.18653/v1/2025.naacl-long.326

On The Origin of Cultural Biases in Language Models: From Pre-training Data to Linguistic Phenomena

Abstract

Language Models (LMs) have been shown to exhibit a strong preference towards entities associated with Western culture when operating in non-Western languages. In this paper, we aim to uncover the origins of entity-related cultural biases in LMs by analyzing several contributing factors, including the representation of entities in pre-training data and the impact of variations in linguistic phenomena across languages. We introduce CAMeL-2, a parallel Arabic-English benchmark of 58,086 entities associated with Arab and Western cultures and 367 masked natural contexts for entities. Our evaluations using CAMeL-2 reveal reduced performance gaps between cultures by LMs when tested in English compared to Arabic. We find that LMs struggle in Arabic with entities that appear at high frequencies in pre-training, where entities can hold multiple word senses. This also extends to entities that exhibit high lexical overlap with languages that are not Arabic but use the Arabic script. Further, we show how frequency-based tokenization leads to this issue in LMs, which gets worse with larger Arabic vocabularies. We will make CAMeL-2 available at: https://github.com/tareknaous/camel2

Anthology ID:: 2025.naacl-long.326
Volume:: Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
Month:: April
Year:: 2025
Address:: Albuquerque, New Mexico
Editors:: Luis Chiruzzo, Alan Ritter, Lu Wang
Venue:: NAACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 6423–6443
Language:
URL:: https://aclanthology.org/2025.naacl-long.326/
DOI:: 10.18653/v1/2025.naacl-long.326
Bibkey:
Cite (ACL):: Tarek Naous and Wei Xu. 2025. On The Origin of Cultural Biases in Language Models: From Pre-training Data to Linguistic Phenomena. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), pages 6423–6443, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):: On The Origin of Cultural Biases in Language Models: From Pre-training Data to Linguistic Phenomena (Naous & Xu, NAACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.naacl-long.326.pdf

PDF Cite Search Fix data