Olga Nevzorova


2017

pdf bib
Russian-Tatar Socio-Political Thesaurus: Methodology, Challenges, the Status of the Project
Alfiya Galieva | Olga Nevzorova | Dilyara Yakubova
Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017

This paper discusses the general methodology and important practical aspects of implementing a new bilingual lexical resource – the Russian-Tatar Socio-Political Thesaurus that is being developed on the basis of the Russian RuThes thesaurus format as a hierarchy of concepts viewed as units of thought. Each concept is linked with a set of language expressions (words and collocations) referring to it in texts (text entries). Currently the Russian-Tatar Socio-Political Thesaurus includes 6,000 concepts, while new concepts and text entries are being constantly added to it. The paper outlines main challenges of translating concept names and their text entries into Tatar, and describes ways of reflecting the specificity of the Tatar lexical-semantic system.