From YCOE to UD: Rule-based Root Identification in Old English

Luca Brigada Villa, Martina Giarda


Abstract
In this paper we apply a set of rules to identify the root of a dependency tree, following the Universal Dependencies formalism and starting from the constituency annotation of the York-Toronto-Helsinki Parsed Corpus of Old English Prose (YCOE). This rule-based root-identification task represents the first step towards a rule-based automatic conversion of this valuable resource into the UD format. After presenting Old English and the annotated resources available for this language, we describe the different rules we applied and then we discuss the results and the errors.
Anthology ID:
2024.lt4hala-1.3
Volume:
Proceedings of the Third Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA) @ LREC-COLING-2024
Month:
May
Year:
2024
Address:
Torino, Italia
Editors:
Rachele Sprugnoli, Marco Passarotti
Venues:
LT4HALA | WS
SIG:
Publisher:
ELRA and ICCL
Note:
Pages:
22–29
Language:
URL:
https://aclanthology.org/2024.lt4hala-1.3
DOI:
Bibkey:
Cite (ACL):
Luca Brigada Villa and Martina Giarda. 2024. From YCOE to UD: Rule-based Root Identification in Old English. In Proceedings of the Third Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA) @ LREC-COLING-2024, pages 22–29, Torino, Italia. ELRA and ICCL.
Cite (Informal):
From YCOE to UD: Rule-based Root Identification in Old English (Brigada Villa & Giarda, LT4HALA-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.lt4hala-1.3.pdf