(Dis)improved?! How Simplified Language Affects Large Language Model Performance across Languages

Miriam Anschütz; Anastasiya Damaratskaya; Chaeeun Joy Lee; Arthur Schmalz; Edoardo Mosca; Georg Groh

(Dis)improved?! How Simplified Language Affects Large Language Model Performance across Languages

Miriam Anschütz, Anastasiya Damaratskaya, Chaeeun Joy Lee, Arthur Schmalz, Edoardo Mosca, Georg Groh

Abstract

Simplified language enhances the accessibility and human understanding of texts. However, whether it also benefits large language models (LLMs) remains underexplored. This paper extensively studies whether LLM performance improves on simplified data compared to its original counterpart. Our experiments span six datasets and nine automatic simplification systems across three languages. We show that English models, including GPT-4o-mini, show a weak generalization and exhibit a significant performance drop on simplified data. This introduces an intriguing paradox: simplified data is helpful for humans but not for LLMs. At the same time, the performance in non-English languages sometimes improves, depending on the task and quality of the simplifier. Our findings offer a comprehensive view of the impact of simplified language on LLM performance and uncover severe implications for people depending on simple language.

Anthology ID:: 2025.gem-1.70
Volume:: Proceedings of the Fourth Workshop on Generation, Evaluation and Metrics (GEM²)
Month:: July
Year:: 2025
Address:: Vienna, Austria and virtual meeting
Editors:: Ofir Arviv, Miruna Clinciu, Kaustubh Dhole, Rotem Dror, Sebastian Gehrmann, Eliya Habba, Itay Itzhak, Simon Mille, Yotam Perlitz, Enrico Santus, João Sedoc, Michal Shmueli Scheuer, Gabriel Stanovsky, Oyvind Tafjord
Venues:: GEM | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 847–861
Language:
URL:: https://aclanthology.org/2025.gem-1.70/
DOI:
Bibkey:
Cite (ACL):: Miriam Anschütz, Anastasiya Damaratskaya, Chaeeun Joy Lee, Arthur Schmalz, Edoardo Mosca, and Georg Groh. 2025. (Dis)improved?! How Simplified Language Affects Large Language Model Performance across Languages. In Proceedings of the Fourth Workshop on Generation, Evaluation and Metrics (GEM²), pages 847–861, Vienna, Austria and virtual meeting. Association for Computational Linguistics.
Cite (Informal):: (Dis)improved?! How Simplified Language Affects Large Language Model Performance across Languages (Anschütz et al., GEM 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.gem-1.70.pdf

PDF Cite Search Fix data