Evaluating a Universal Dependencies Conversion Pipeline for Icelandic

Þórunn Arnardóttir, Hinrik Hafsteinsson, Atli Jasonarson, Anton Ingason, Steinþór Steingrímsson


Abstract
We describe the evaluation and development of a rule-based treebank conversion tool, UDConverter, which converts treebanks from the constituency-based PPCHE annotation scheme to the dependency-based Universal Dependencies (UD) scheme. The tool has already been used in the production of three UD treebanks, although no formal evaluation of the tool has been carried out as of yet. By manually correcting new output files from the converter and comparing them to the raw output, we measured the labeled attachment score (LAS) and unlabeled attachment score (UAS) of the converted texts. We obtain an LAS of 82.87 and a UAS of 87.91. In comparison to other tools, UDConverter currently provides the best results in automatic UD treebank creation for Icelandic.
Anthology ID:
2023.nodalida-1.69
Volume:
Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)
Month:
May
Year:
2023
Address:
Tórshavn, Faroe Islands
Editors:
Tanel Alumäe, Mark Fishel
Venue:
NoDaLiDa
SIG:
Publisher:
University of Tartu Library
Note:
Pages:
698–704
Language:
URL:
https://aclanthology.org/2023.nodalida-1.69
DOI:
Bibkey:
Cite (ACL):
Þórunn Arnardóttir, Hinrik Hafsteinsson, Atli Jasonarson, Anton Ingason, and Steinþór Steingrímsson. 2023. Evaluating a Universal Dependencies Conversion Pipeline for Icelandic. In Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa), pages 698–704, Tórshavn, Faroe Islands. University of Tartu Library.
Cite (Informal):
Evaluating a Universal Dependencies Conversion Pipeline for Icelandic (Arnardóttir et al., NoDaLiDa 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.nodalida-1.69.pdf