A Universal Dependencies Treebank of Ancient Hebrew

Daniel Swanson, Francis Tyers


Abstract
In this paper we present the initial construction of a Universal Dependencies treebank with morphological annotations of Ancient Hebrew containing portions of the Hebrew Scriptures (1579 sentences, 27K tokens) for use in comparative study with ancient translations and for analysis of the development of Hebrew syntax. We construct this treebank by applying a rule-based parser (300 rules) to an existing morphologically-annotated corpus with minimal constituency structure and manually verifying the output and present the results of this semi-automated annotation process and some of the annotation decisions made in the process of applying the UD guidelines to a new language.
Anthology ID:
2022.lrec-1.252
Volume:
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
2353–2361
Language:
URL:
https://aclanthology.org/2022.lrec-1.252
DOI:
Bibkey:
Cite (ACL):
Daniel Swanson and Francis Tyers. 2022. A Universal Dependencies Treebank of Ancient Hebrew. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 2353–2361, Marseille, France. European Language Resources Association.
Cite (Informal):
A Universal Dependencies Treebank of Ancient Hebrew (Swanson & Tyers, LREC 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.lrec-1.252.pdf