Sense-Annotated Corpus for Russian

Alexander Kirillovich, Natalia Loukachevitch, Maksim Kulaev, Angelina Bolshina, Dmitry Ilvovsky


Abstract
We present a sense-annotated corpus for Russian. The resource was obtained my manually annotating texts from the OpenCorpora corpus, an open corpus for the Russian language, by senses of Russian wordnet RuWordNet. The annotation was used as a test collection for comparing unsupervised (Personalized Pagerank) and pseudo-labeling methods for Russian word sense disambiguation.
Anthology ID:
2022.clib-1.15
Volume:
Proceedings of the 5th International Conference on Computational Linguistics in Bulgaria (CLIB 2022)
Month:
September
Year:
2022
Address:
Sofia, Bulgaria
Venue:
CLIB
SIG:
Publisher:
Department of Computational Linguistics, IBL -- BAS
Note:
Pages:
130–136
Language:
URL:
https://aclanthology.org/2022.clib-1.15
DOI:
Bibkey:
Cite (ACL):
Alexander Kirillovich, Natalia Loukachevitch, Maksim Kulaev, Angelina Bolshina, and Dmitry Ilvovsky. 2022. Sense-Annotated Corpus for Russian. In Proceedings of the 5th International Conference on Computational Linguistics in Bulgaria (CLIB 2022), pages 130–136, Sofia, Bulgaria. Department of Computational Linguistics, IBL -- BAS.
Cite (Informal):
Sense-Annotated Corpus for Russian (Kirillovich et al., CLIB 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.clib-1.15.pdf