Beyond Monolithic Culture: Evaluating Understandability of Online Text Across Cultural Dimensions

Saurabh Kumar Pandey; Harshit Gupta; Sougata Saha; Monojit Choudhury

Beyond Monolithic Culture: Evaluating Understandability of Online Text Across Cultural Dimensions

Saurabh Kumar Pandey, Harshit Gupta, Sougata Saha, Monojit Choudhury

Abstract

Culture shapes how people interpret language, especially in online reviews containing culture-specific items (CSIs). Yet, most existing evaluations treat culture as a monolithic construct, offering no insight into which cultural dimensions pose difficulty for readers, or how large language models (LLMs), which power AI reading assistants, perform across them. This gap limits our ability to obtain reliable, cross-cultural estimates of model performance. To address this, we analyze CSIs in English Goodreads reviews across Newmark’s cultural dimensions (e.g., material, ecology, customs, habits, social) and evaluate six LLMs of varying sizes on their ability to identify CSIs within each dimension. We find that readers struggle most with CSIs from the material, customs, and social dimensions, while models underperform on more localized ones (e.g., habits), revealing systematic cultural blind spots. To support further research on culturally representative benchmarking, we release an expert-annotated dataset of CSIs labeled by cultural dimension. Empirical analysis shows our dataset as more challenging and of higher quality than existing cultural benchmarks, enabling finer-grained evaluation of cultural understanding in models.

Anthology ID:: 2026.c3nlp-1.16
Volume:: Proceedings of the 4th Workshop on Cross-Cultural Considerations in NLP (C3NLP 2026)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Vinodkumar Prabhakaran, Sunipa Dev, Luciana Benotti, Daniel Hershcovich, Yong Cao, Li Zhou, BOlei Ma, Ife Adebara
Venues:: C3NLP | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 204–220
Language:
URL:: https://aclanthology.org/2026.c3nlp-1.16/
DOI:
Bibkey:
Cite (ACL):: Saurabh Kumar Pandey, Harshit Gupta, Sougata Saha, and Monojit Choudhury. 2026. Beyond Monolithic Culture: Evaluating Understandability of Online Text Across Cultural Dimensions. In Proceedings of the 4th Workshop on Cross-Cultural Considerations in NLP (C3NLP 2026), pages 204–220, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Beyond Monolithic Culture: Evaluating Understandability of Online Text Across Cultural Dimensions (Pandey et al., C3NLP 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.c3nlp-1.16.pdf

PDF Cite Search Fix data