Tudor Voicu
2024
Function Multiword Expressions Annotated with Discourse Relations in the Romanian Reference Treebank
Verginica Barbu Mititelu
|
Tudor Voicu
Proceedings of the Sixth International Conference on Computational Linguistics in Bulgaria (CLIB 2024)
For the Romanian Reference Treebank, a general language corpus, covering several genres and annotated according to the principles of Universal Dependencies, we present here the annotation of some function words, namely multiword conjunctions, with discourse relations from the Penn Discourse Treebank version 3.0 inventory of such relations. The annotation process was manual, with two annotators for each occurrence of the conjunctions. Lexical-semantic relations of the types synonymy, polysemy can be established between the senses of such conjunctions. The discourse relations are added to the CoNLL-U file in which the treebank is represented.