A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future Directions

Junchao Wu; Shu Yang; Runzhe Zhan (詹润哲); Yulin Yuan (袁毓林); Lidia Sam Chao; Derek Fai Wong

doi:10.1162/coli_a_00549

A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future Directions

Junchao Wu, Shu Yang, Runzhe Zhan, Yulin Yuan, Lidia Sam Chao, Derek Fai Wong

Abstract

The remarkable ability of large language models (LLMs) to comprehend, interpret, and generate complex language has rapidly integrated LLM-generated text into various aspects of daily life, where users increasingly accept it. However, the growing reliance on LLMs underscores the urgent need for effective detection mechanisms to identify LLM-generated text. Such mechanisms are critical to mitigating misuse and safeguarding domains like artistic expression and social networks from potential negative consequences. LLM-generated text detection, conceptualized as a binary classification task, seeks to determine whether an LLM produced a given text. Recent advances in this field stem from innovations in watermarking techniques, statistics-based detectors, and neural-based detectors. Human-assisted methods also play a crucial role. In this survey, we consolidate recent research breakthroughs in this field, emphasizing the urgent need to strengthen detector research. Additionally, we review existing datasets, highlighting their limitations and developmental requirements. Furthermore, we examine various LLM-generated text detection paradigms, shedding light on challenges like out-of-distribution problems, potential attacks, real-world data issues, and ineffective evaluation frameworks. Finally, we outline intriguing directions for future research in LLM-generated text detection to advance responsible artificial intelligence. This survey aims to provide a clear and comprehensive introduction for newcomers while offering seasoned researchers valuable updates in the field.1

Anthology ID:: 2025.cl-1.8
Volume:: Computational Linguistics, Volume 51, Issue 1 - March 2025
Month:: March
Year:: 2025
Address:: Cambridge, MA
Venue:: CL
SIG:
Publisher:: MIT Press
Note:
Pages:: 275–338
Language:
URL:: https://aclanthology.org/2025.cl-1.8/
DOI:: 10.1162/coli_a_00549
Bibkey:
Cite (ACL):: Junchao Wu, Shu Yang, Runzhe Zhan, Yulin Yuan, Lidia Sam Chao, and Derek Fai Wong. 2025. A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future Directions. Computational Linguistics, 51(1):275–338.
Cite (Informal):: A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future Directions (Wu et al., CL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.cl-1.8.pdf

PDF Cite Search Fix data