The Rise and Fall of Dependency Parsing in Dante Alighieri’s Divine Comedy

Claudia Corbetta, Marco Passarotti, Giovanni Moretti


Abstract
In this paper, we conduct parsing experiments on Dante Alighieri’s Divine Comedy, an Old Italian poem composed between 1306-1321 and organized into three Cantiche —Inferno, Purgatorio, and Paradiso. We perform parsing on subsets of the poem using both a Modern Italian training set and sections of the Divine Comedy itself to evaluate under which scenarios parsers achieve higher scores. We find that employing in-domain training data supports better results, leading to an increase of approximately +17% in Unlabeled Attachment Score (UAS) and +25-30% in Labeled Attachment Score (LAS). Subsequently, we provide brief commentary on the differences in scores achieved among subsections of Cantiche, and we conduct experimental parsing on a text from the same period and style as the Divine Comedy.
Anthology ID:
2024.lt4hala-1.7
Volume:
Proceedings of the Third Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA) @ LREC-COLING-2024
Month:
May
Year:
2024
Address:
Torino, Italia
Editors:
Rachele Sprugnoli, Marco Passarotti
Venues:
LT4HALA | WS
SIG:
Publisher:
ELRA and ICCL
Note:
Pages:
50–56
Language:
URL:
https://aclanthology.org/2024.lt4hala-1.7
DOI:
Bibkey:
Cite (ACL):
Claudia Corbetta, Marco Passarotti, and Giovanni Moretti. 2024. The Rise and Fall of Dependency Parsing in Dante Alighieri’s Divine Comedy. In Proceedings of the Third Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA) @ LREC-COLING-2024, pages 50–56, Torino, Italia. ELRA and ICCL.
Cite (Informal):
The Rise and Fall of Dependency Parsing in Dante Alighieri’s Divine Comedy (Corbetta et al., LT4HALA-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.lt4hala-1.7.pdf