Style as Signature: Profile-Based Authorship Verification of Mihai Eminescu’s Journalistic Corpus

Ioana-Roxana Boriceanu, Liviu Dinu


Abstract
Authorship verification aims to assess whether a questioned text is stylistically compatible with an author’s known writings, a task that is particularly challenging in historical corpora with partial ground truth. We address this problem in the context of Mihai Eminescu’s journalistic corpus, a historically grounded collection comprising published articles, manuscripts, and texts of uncertain authorship. Using a profile-based framework with character n-grams and function words, we examine how stylistic compatibility behaves across different profile construction settings and temporal splits. The results show that character trigram profiles consistently accept verified texts while producing a small and stable set of rejections among disputed items, whereas function word profiles show near complete acceptance across the corpus. A qualitative analysis shows that rejected texts exhibit meaningful differences in discourse structure and communicative purpose. These findings illustrate how authorship verification can support literary scholarship through stable signals for close reading.
Anthology ID:
2026.latechclfl-1.10
Volume:
Proceedings of the 10th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature 2026
Month:
March
Year:
2026
Address:
Rabat, Morocco
Editors:
Diego Alves, Yuri Bizzoni, Stefania Degaetano-Ortlieb, Anna Kazantseva, Janis Pagel, Stan Szpakowicz
Venues:
LaTeCH-CLfL | WS
SIG:
SIGHUM
Publisher:
Association for Computational Linguistics
Note:
Pages:
102–110
Language:
URL:
https://aclanthology.org/2026.latechclfl-1.10/
DOI:
Bibkey:
Cite (ACL):
Ioana-Roxana Boriceanu and Liviu Dinu. 2026. Style as Signature: Profile-Based Authorship Verification of Mihai Eminescu’s Journalistic Corpus. In Proceedings of the 10th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature 2026, pages 102–110, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):
Style as Signature: Profile-Based Authorship Verification of Mihai Eminescu’s Journalistic Corpus (Boriceanu & Dinu, LaTeCH-CLfL 2026)
Copy Citation:
PDF:
https://aclanthology.org/2026.latechclfl-1.10.pdf
Supplementarymaterial:
 2026.latechclfl-1.10.SupplementaryMaterial.txt
Supplementarymaterial:
 2026.latechclfl-1.10.SupplementaryMaterial.zip