Versus: an automatic text comparison tool for the digital humanities

Motasem Alrahabi, Tom Wainstain


Abstract
Digital humanities (DH) have been exploring large-scale textual reuse for several decades: quotation, allusion, paraphrase, translation, rephrasing. Automatic comparison, made possible by the increasing digitization of corpora, opens new perspectives in philology and intertextual studies. This article presents a state of the art of existing methods (formal, vector-based, statistical, graph-based) and introduces an open-source tool, Versus, which combines multigranular vector alignment, interactive visualization, and critical traceability. This framework aims to provide a reproducible and accessible solution for DH researchers, with support for text comparison in multiple languages.
Anthology ID:
2025.lm4dh-1.3
Volume:
Proceedings of the First on Natural Language Processing and Language Models for Digital Humanities
Month:
September
Year:
2025
Address:
Varna, Bulgaria
Editors:
Isuri Nanomi Arachchige, Francesca Frontini, Ruslan Mitkov, Paul Rayson
Venues:
LM4DH | WS
SIG:
Publisher:
INCOMA Ltd., Shoumen, Bulgaria
Note:
Pages:
32–37
Language:
URL:
https://aclanthology.org/2025.lm4dh-1.3/
DOI:
Bibkey:
Cite (ACL):
Motasem Alrahabi and Tom Wainstain. 2025. Versus: an automatic text comparison tool for the digital humanities. In Proceedings of the First on Natural Language Processing and Language Models for Digital Humanities, pages 32–37, Varna, Bulgaria. INCOMA Ltd., Shoumen, Bulgaria.
Cite (Informal):
Versus: an automatic text comparison tool for the digital humanities (Alrahabi & Wainstain, LM4DH 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.lm4dh-1.3.pdf
Optionalsupplementarymaterial:
 2025.lm4dh-1.3.OptionalSupplementaryMaterial.zip