The Index Thomisticus Treebank as Linked Data in the LiLa Knowledge Base

Francesco Mambrini, Marco Passarotti, Giovanni Moretti, Matteo Pellegrini


Abstract
Although the Universal Dependencies initiative today allows for cross-linguistically consistent annotation of morphology and syntax in treebanks for several languages, syntactically annotated corpora are not yet interoperable with many lexical resources that describe properties of the words that occur therein. In order to cope with such limitation, we propose to adopt the principles of the Linguistic Linked Open Data community, to describe and publish dependency treebanks as LLOD. In particular, this paper illustrates the approach pursued in the LiLa Knowledge Base, which enables interoperability between corpora and lexical resources for Latin, to publish as Linguistic Linked Open Data the annotation layers of two versions of a Medieval Latin treebank (the Index Thomisticus Treebank).
Anthology ID:
2022.lrec-1.428
Volume:
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
4022–4029
Language:
URL:
https://aclanthology.org/2022.lrec-1.428
DOI:
Bibkey:
Cite (ACL):
Francesco Mambrini, Marco Passarotti, Giovanni Moretti, and Matteo Pellegrini. 2022. The Index Thomisticus Treebank as Linked Data in the LiLa Knowledge Base. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 4022–4029, Marseille, France. European Language Resources Association.
Cite (Informal):
The Index Thomisticus Treebank as Linked Data in the LiLa Knowledge Base (Mambrini et al., LREC 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.lrec-1.428.pdf