Universal Dependencies for Albanian

Marsida Toska, Joakim Nivre, Daniel Zeman


Abstract
In this paper, we introduce the first Universal Dependencies (UD) treebank for standard Albanian, consisting of 60 sentences collected from the Albanian Wikipedia, annotated with lemmas, universal part-of-speech tags, morphological features and syntactic dependencies. In addition to presenting the treebank itself, we discuss a selection of linguistic constructions in Albanian whose analysis in UD is not self-evident, including core arguments and the status of indirect objects, pronominal clitics, genitive constructions, prearticulated adjectives, and modal verbs.
Anthology ID:
2020.udw-1.20
Volume:
Proceedings of the Fourth Workshop on Universal Dependencies (UDW 2020)
Month:
December
Year:
2020
Address:
Barcelona, Spain (Online)
Venues:
COLING | UDW
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
178–188
Language:
URL:
https://aclanthology.org/2020.udw-1.20
DOI:
Bibkey:
Cite (ACL):
Marsida Toska, Joakim Nivre, and Daniel Zeman. 2020. Universal Dependencies for Albanian. In Proceedings of the Fourth Workshop on Universal Dependencies (UDW 2020), pages 178–188, Barcelona, Spain (Online). Association for Computational Linguistics.
Cite (Informal):
Universal Dependencies for Albanian (Toska et al., UDW 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.udw-1.20.pdf
Data
Universal Dependencies