Tamar Jalaghonia


2024

pdf bib
Building a Universal Dependencies Treebank for Georgian
Irina Lobzhanidze | Erekle Magradze | Svetlana Berikashvili | Anzor Gozalishvili | Tamar Jalaghonia
Proceedings of the 22nd Workshop on Treebanks and Linguistic Theories (TLT 2024)

This paper presents the design and development of the Georgian Syntactic Treebank within the Universal Dependencies (UD) framework, addressing the unique morphosyntactic challenges ofGeorgian, a Kartvelian language. We describe the methodology for selecting andannotating 3,013 sentences from Wiki, mapping existing tagsets to the UD scheme, and converting data into the CoNLL-U format. The paper also details the training of a UDPipe model using this preliminary treebank.