Exploring the Limitations of Detecting Machine-Generated Text

Jad Doughman; Osama Mohammed Afzal; Hawau Olamide Toyin; Shady Shehata; Preslav Nakov; Zeerak Talat

Exploring the Limitations of Detecting Machine-Generated Text

Jad Doughman, Osama Mohammed Afzal, Hawau Olamide Toyin, Shady Shehata, Preslav Nakov, Zeerak Talat

Abstract

Recent improvements in the quality of the generations by large language models have spurred research into identifying machine-generated text. Such work often presents high-performing detectors. However, humans and machines can produce text in different styles and domains, yet the the performance impact of such on machine generated text detection systems remains unclear. In this paper, we audit the classification performance for detecting machine-generated text by evaluating on texts with varying writing styles. We find that classifiers are highly sensitive to stylistic changes and differences in text complexity, and in some cases degrade entirely to random classifiers. We further find that detection systems are particularly susceptible to misclassify easy-to-read texts while they have high performance for complex texts, leading to concerns about the reliability of detection systems. We recommend that future work attends to stylistic factors and reading difficulty levels of human-written and machine-generated text.

Anthology ID:: 2025.coling-main.288
Volume:: Proceedings of the 31st International Conference on Computational Linguistics
Month:: January
Year:: 2025
Address:: Abu Dhabi, UAE
Editors:: Owen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa, Barbara Di Eugenio, Steven Schockaert
Venue:: COLING
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 4274–4281
Language:
URL:: https://aclanthology.org/2025.coling-main.288/
DOI:
Bibkey:
Cite (ACL):: Jad Doughman, Osama Mohammed Afzal, Hawau Olamide Toyin, Shady Shehata, Preslav Nakov, and Zeerak Talat. 2025. Exploring the Limitations of Detecting Machine-Generated Text. In Proceedings of the 31st International Conference on Computational Linguistics, pages 4274–4281, Abu Dhabi, UAE. Association for Computational Linguistics.
Cite (Informal):: Exploring the Limitations of Detecting Machine-Generated Text (Doughman et al., COLING 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.coling-main.288.pdf

PDF Cite Search Fix data