UZWORDNET: A Lexical-Semantic Database for the Uzbek Language

Alessandro Agostini, Timur Usmanov, Ulugbek Khamdamov, Nilufar Abdurakhmonova, Mukhammadsaid Mamasaidov


Abstract
The results reported in this paper aim to increase the presence of the Uzbek language in the Internet and its usability within IT applications. We describe the initial development of a “word-net” for the Uzbek language compatible to Princeton WordNet. We called it UZWORDNET. In the current version, UZWORDNET contains 28140 synsets, 64389 sense and 20683 words; its estimated accuracy is 75.98%. To the best of our knowledge, it is the largest wordnet for Uzbek existing to date, and the second wordnet developed overall.
Anthology ID:
2021.gwc-1.2
Volume:
Proceedings of the 11th Global Wordnet Conference
Month:
January
Year:
2021
Address:
University of South Africa (UNISA)
Editors:
Piek Vossen, Christiane Fellbaum
Venue:
GWC
SIG:
SIGLEX
Publisher:
Global Wordnet Association
Note:
Pages:
8–19
Language:
URL:
https://aclanthology.org/2021.gwc-1.2
DOI:
Bibkey:
Cite (ACL):
Alessandro Agostini, Timur Usmanov, Ulugbek Khamdamov, Nilufar Abdurakhmonova, and Mukhammadsaid Mamasaidov. 2021. UZWORDNET: A Lexical-Semantic Database for the Uzbek Language. In Proceedings of the 11th Global Wordnet Conference, pages 8–19, University of South Africa (UNISA). Global Wordnet Association.
Cite (Informal):
UZWORDNET: A Lexical-Semantic Database for the Uzbek Language (Agostini et al., GWC 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.gwc-1.2.pdf
Code
 LDKR-Group/UzWordnet
Data
UzWordnet