Stylometric Approach to AI-generated Texts. An Analysis of Contemporary French-Language Literature

Adam Pawłowski; Tomasz Walkowiak

Stylometric Approach to AI-generated Texts. An Analysis of Contemporary French-Language Literature

Abstract

The article focuses on a stylometric analysis of authentic literary texts and thematically related texts generated by large language models. The texts under study represent a fairly broad cross-section of twentieth-century French literature. Five models were used to generate the texts (ChatGPT 4-o, GPT 4-o mini, DeepSeek v.3, c4ai-command-r-plus, and c4ai-command-a). The original human-written stories of approximately 20,000 characters were summarized, and new narratives were then generated on the basis of these abstracts. In terms of plot and style, they were intended to resemble the originals. The research carried out with TF-IDF of the most frequent words showed that texts generated by specific LLMs and written by humans cluster relatively well as distinct groups. The experiments also showed that the "authorial" specificity of machine-generated texts partly matches the original clustering of human-written source texts.

Anthology ID:: 2026.latechclfl-1.21
Volume:: Proceedings of the 10th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature 2026
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Diego Alves, Yuri Bizzoni, Stefania Degaetano-Ortlieb, Anna Kazantseva, Janis Pagel, Stan Szpakowicz
Venues:: LaTeCH-CLfL | WS
SIG:: SIGHUM
Publisher:: Association for Computational Linguistics
Note:
Pages:: 221–226
Language:
URL:: https://aclanthology.org/2026.latechclfl-1.21/
DOI:
Bibkey:
Cite (ACL):: Adam Pawłowski and Tomasz Walkowiak. 2026. Stylometric Approach to AI-generated Texts. An Analysis of Contemporary French-Language Literature. In Proceedings of the 10th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature 2026, pages 221–226, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: Stylometric Approach to AI-generated Texts. An Analysis of Contemporary French-Language Literature (Pawłowski & Walkowiak, LaTeCH-CLfL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.latechclfl-1.21.pdf
Supplementarymaterial:: 2026.latechclfl-1.21.SupplementaryMaterial.txt
Supplementarymaterial:: 2026.latechclfl-1.21.SupplementaryMaterial.zip

PDF Cite Search Supplementarymaterial Supplementarymaterial Fix data