Is Human-Like Text Liked by Humans? Multilingual Human Detection and Preference Against AI

Yuxia Wang; Rui Xing; Jonibek Mansurov; Giovanni Puccetti; Zhuohan Xie; Minh Ngoc Ta; Jiahui Geng; Jinyan Su; Mervat Abassy; Saadeldine Eletter; Kareem Elozeiri; Nurkhan Laiyk; Maiya Goloburda; Tarek Mahmoud; Raj Vardhan Tomar; Alexander Aziz; Ryuto Koike; Masahiro Kaneko; Artem Shelmanov; Ekaterina Artemova; Vladislav Mikhailov; Akim Tsvigun; Alham Fikri Aji; Nizar Habash; Iryna Gurevych; Preslav Nakov

Is Human-Like Text Liked by Humans? Multilingual Human Detection and Preference Against AI

Yuxia Wang, Rui Xing, Jonibek Mansurov, Giovanni Puccetti, Zhuohan Xie, Minh Ngoc Ta, Jiahui Geng, Jinyan Su, Mervat Abassy, Saadeldine Eletter, Kareem Elozeiri, Nurkhan Laiyk, Maiya Goloburda, Tarek Mahmoud, Raj Vardhan Tomar, Alexander Aziz, Ryuto Koike, Masahiro Kaneko, Artem Shelmanov, Ekaterina Artemova, Vladislav Mikhailov, Akim Tsvigun, Alham Fikri Aji, Nizar Habash, Iryna Gurevych, Preslav Nakov

Abstract

Prior studies have shown that distinguishing text generated by Large Language Models (LLMs) from human-written one is highly challenging for humans, and often no better than random guessing. To verify the generalizability of this finding across languages and domains, we perform an extensive case study to identify the upper bound of human detection accuracy. Across 16 datasets covering 9 languages and 9 domains, 19 annotators achieved an average detection accuracy of 87.6%, thus challenging previous conclusions. We find that major gaps between human and machine text lie in concreteness, cultural nuances, and diversity. Prompting by explicitly explaining the distinctions in the prompts can partially bridge the gaps in over 50% of the cases. However, we also find that humans do not always prefer human-written text, particularly when they cannot clearly identify its source. We release our dataset, the human labels, and the annotator metadata at https://github.com/xnlp-lab/HumanEval-MGT.

Anthology ID:: 2026.acl-long.639
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 14043–14076
Language:
URL:: https://aclanthology.org/2026.acl-long.639/
DOI:
Bibkey:
Cite (ACL):: Yuxia Wang, Rui Xing, Jonibek Mansurov, Giovanni Puccetti, Zhuohan Xie, Minh Ngoc Ta, Jiahui Geng, Jinyan Su, Mervat Abassy, Saadeldine Eletter, Kareem Elozeiri, Nurkhan Laiyk, Maiya Goloburda, Tarek Mahmoud, Raj Vardhan Tomar, Alexander Aziz, Ryuto Koike, Masahiro Kaneko, Artem Shelmanov, Ekaterina Artemova, Vladislav Mikhailov, Akim Tsvigun, Alham Fikri Aji, Nizar Habash, Iryna Gurevych, and Preslav Nakov. 2026. Is Human-Like Text Liked by Humans? Multilingual Human Detection and Preference Against AI. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 14043–14076, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Is Human-Like Text Liked by Humans? Multilingual Human Detection and Preference Against AI (Wang et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.639.pdf
Checklist:: 2026.acl-long.639.checklist.pdf

PDF Cite Search Checklist Fix data