SoS: Analysis of Surface over Semantics in Multilingual Text-To-Image Generation

Carolin Holtermann; Florian Schneider; Anne Lauscher

SoS: Analysis of Surface over Semantics in Multilingual Text-To-Image Generation

Carolin Holtermann, Florian Schneider, Anne Lauscher

Abstract

Text-to-image (T2I) models are increasingly employed by users worldwide. However, prior research has pointed to the high sensitivity of T2I towards particular input languages - when faced with languages other than English (i.e., different surface forms of the same prompt), T2I models often produce culturally stereotypical depictions, prioritizing the surface over the prompt’s semantics. Yet a comprehensive analysis of this behavior, which we dub Surface-over-Semantics (SoS), is missing. We present the first analysis of T2I models’ SoS tendencies. To this end, we create a set of prompts covering 171 cultural identities, translated into 14 languages, and use it to prompt seven T2I models. To quantify SoS tendencies across models, languages, and cultures, we introduce a novel measure and analyze how the tendencies we identify manifest visually. We show that all but one model exhibit strong surface-level tendency in at least two languages, with this effect intensifying across the layers of T2I text encoders. Moreover, these surface tendencies frequently correlate with stereotypical visual depictions.

Anthology ID:: 2026.eacl-long.185
Volume:: Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Vera Demberg, Kentaro Inui, Lluís Marquez
Venue:: EACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 3955–3995
Language:
URL:: https://aclanthology.org/2026.eacl-long.185/
DOI:
Bibkey:
Cite (ACL):: Carolin Holtermann, Florian Schneider, and Anne Lauscher. 2026. SoS: Analysis of Surface over Semantics in Multilingual Text-To-Image Generation. In Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), pages 3955–3995, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: SoS: Analysis of Surface over Semantics in Multilingual Text-To-Image Generation (Holtermann et al., EACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.eacl-long.185.pdf
Checklist:: 2026.eacl-long.185.checklist.pdf

PDF Cite Search Checklist Fix data