Ontologies for Natural Language Processing: Case of Russian

Natalia Loukachevitch, Boris Dobrov


Abstract
The paper describes the RuThes family of Russian thesauri intended for natural language processing and information retrieval applications. RuThes-like thesauri include, besides RuThes, Sociopolitical thesaurus, Security Thesaurus, and Ontology on Natural Sciences and Technologies. The RuThes format is based on three approaches for developing computer resources: Princeton WordNet, information-retrieval thesauri, and formal ontologies. The published version of RuThes thesaurus (RuThes-lite 2.0) became a basis for semi-automatic generation of RuWordNet, WordNet-like thesaurus for Russian. Currently researchers can use both RuThes-lite or RuWordNet and compare them in applications. Other RuThes-like resources are being prepared to publication.
Anthology ID:
2018.clib-1.13
Volume:
Proceedings of the Third International Conference on Computational Linguistics in Bulgaria (CLIB 2018)
Month:
May
Year:
2018
Address:
Sofia, Bulgaria
Venue:
CLIB
SIG:
Publisher:
Department of Computational Linguistics, Institute for Bulgarian Language, Bulgarian Academy of Sciences
Note:
Pages:
93–103
Language:
URL:
https://aclanthology.org/2018.clib-1.13
DOI:
Bibkey:
Cite (ACL):
Natalia Loukachevitch and Boris Dobrov. 2018. Ontologies for Natural Language Processing: Case of Russian. In Proceedings of the Third International Conference on Computational Linguistics in Bulgaria (CLIB 2018), pages 93–103, Sofia, Bulgaria. Department of Computational Linguistics, Institute for Bulgarian Language, Bulgarian Academy of Sciences.
Cite (Informal):
Ontologies for Natural Language Processing: Case of Russian (Loukachevitch & Dobrov, CLIB 2018)
Copy Citation:
PDF:
https://aclanthology.org/2018.clib-1.13.pdf