Borislav Rizov


2022

pdf bib
Towards Dynamic Wordnet: Time Flow Hydra
Borislav Rizov | Tinko Tinchev
Proceedings of the Fifth International Conference on Computational Linguistics in Bulgaria (CLIB 2022)

Hydra is a Wordnet management system where the Synsets from different languages live in a common relational structure (Kripke frame) with a user-frendly GUI for searching, editing and alignment of the objects from the different languages. The data is retrieved by means of a modal logic query language. Despite its many merits the system stores only the current state of the wordnet data. Wordnet editing and development opens questions for wordnet data, structure and its consistency over time. The new Time Flow Hydra uses a Dynamic wordnet model with a discrete time embeded where all the states of all the objects are stored and accessed simultaneously. This provides the ability to track the changes, to detect the desired and undesired results of the data evolution. For example, we can ask which objects 10 days ago had 2 hyponyms, and 5 days later have 3.

2018

pdf bib
Online Editor for WordNets
Borislav Rizov | Tsvetana Dimitrova
Proceedings of the Third International Conference on Computational Linguistics in Bulgaria (CLIB 2018)

The paper presents an online editor for lexical-semantic databases with relational structure similar to the structure of WordNet – Hydra for Web. It supports functionalities for editing of relational data (including query, creation, change, and linking of relational objects), simultaneous access of multiple user profiles, parallel data visualization and editing of the data on top of single- and parallel mode visualization of the language data.

2016

pdf bib
Hydra for Web: A Browser for Easy Access to Wordnets
Borislav Rizov | Tsvetana Dimitrova
Proceedings of the 8th Global WordNet Conference (GWC)

This paper presents a web interface for wordnets named Hydra for Web which is built on top of Hydra – an open source tool for wordnet development – by means of modern web technologies. It is a Single Page Application with simple but powerful and convenient GUI. It has two modes for visualisation of the language correspondences of searched (and found) wordnet synsets – single and parallel modes. Hydra for web is available at: http://dcl.bas.bg/bulnet/.

2015

pdf bib
Automatic Classification of WordNet Morphosemantic Relations
Svetlozara Leseva | Ivelina Stoyanova | Maria Todorova | Tsvetana Dimitrova | Borislav Rizov | Svetla Koeva
The 5th Workshop on Balto-Slavic Natural Language Processing

2014

pdf bib
Coping with Derivation in the Bulgarian Wordnet
Tsvetana Dimitrova | Ekaterina Tarpomanova | Borislav Rizov
Proceedings of the Seventh Global Wordnet Conference

pdf bib
Hydra: A Software System for Wordnet
Borislav Rizov
Proceedings of the Seventh Global Wordnet Conference

pdf bib
Automatic Semantic Filtering of Morphosemantic Relations in WordNet
Svetlozara Leseva | Ivelina Stoyanova | Borislav Rizov | Maria Todorova | Ekaterina Tarpomanova
Proceedings of the First International Conference on Computational Linguistics in Bulgaria (CLIB 2014)

In this paper we present a method for automatic assignment of morphosemantic relations between derivationally related verb–noun pairs of synsets in the Bulgarian WordNet (BulNet) and for semantic filtering of those relations. The filtering process relies on the meaning of noun suffixes and the semantic compatibility of verb and noun taxonomic classes. We use the taxonomic labels assigned to all the synsets in the Princeton WordNet (PWN) – one label per synset – which denote their general semantic class. In the first iteration we employ the pairs <noun suffix : noun label> to filter out part of the relations. In the second iteration, which uses as input the output of the first one, we apply a stronger semantic filter. It makes use of the taxonomic labels of the noun-verb synset pairs observed for a given morphosemantic relation. In this way we manage to reliably filter out impossible or unlikely combinations. The results of the performed experiment may be applied to enrich BulNet with morphosemantic relations and new synsets semi-automatically, while facilitating the manual work and reducing its cost.

pdf bib
Noun-Verb Derivation in the Bulgarian and the Romanian WordNet – A Comparative Approach
Ekaterina Tarpomanova | Svetlozara Leseva | Maria Todorova | Tsvetana Dimitrova | Borislav Rizov | Verginica Barbu Mititelu | Elena Irimia
Proceedings of the First International Conference on Computational Linguistics in Bulgaria (CLIB 2014)

Romanian and Bulgarian are Balkan languages with rich derivational morphology that, if introduced into their respective wordnets, can aid broadening of the wordnet content and the possible NLP applications. In this paper we present a joint work on introducing derivation into the Bulgarian and the Romanian WordNets, BulNet and RoWordNet, respectively, by identifying and subsequently labelling the derivationally and semantically related noun-verb pairs. Our research aims at providing a framework for a comparative study on derivation in the two languages and offering training material for the automatic identification and assignment of derivational and morphosemantic relations needed in various applications.

pdf bib
Anaphora – Clause Annotation and Alignment Tool.
Borislav Rizov | Rositsa Dekova
Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics

2012

pdf bib
Application of Clause Alignment for Statistical Machine Translation
Svetla Koeva | Svetlozara Leseva | Ivelina Stoyanova | Rositsa Dekova | Angel Genov | Borislav Rizov | Tsvetana Dimitrova | Ekaterina Tarpomanova | Hristina Kukova
Proceedings of the Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation

pdf bib
Bulgarian X-language Parallel Corpus
Svetla Koeva | Ivelina Stoyanova | Rositsa Dekova | Borislav Rizov | Angel Genov
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)

The paper presents the methodology and the outcome of the compilation and the processing of the Bulgarian X-language Parallel Corpus (Bul-X-Cor) which was integrated as part of the Bulgarian National Corpus (BulNC). We focus on building representative parallel corpora which include a diversity of domains and genres, reflect the relations between Bulgarian and other languages and are consistent in terms of compilation methodology, text representation, metadata description and annotation conventions. The approaches implemented in the construction of Bul-X-Cor include using readily available text collections on the web, manual compilation (by means of Internet browsing) and preferably automatic compilation (by means of web crawling ― general and focused). Certain levels of annotation applied to Bul-X-Cor are taken as obligatory (sentence segmentation and sentence alignment), while others depend on the availability of tools for a particular language (morpho-syntactic tagging, lemmatisation, syntactic parsing, named entity recognition, word sense disambiguation, etc.) or for a particular task (word and clause alignment). To achieve uniformity of the annotation we have either annotated raw data from scratch or transformed the already existing annotation to follow the conventions accepted for BulNC. Finally, actual uses of the corpora are presented and conclusions are drawn with respect to future work.

2008

pdf bib
Chooser: a Multi-Task Annotation Tool
Svetla Koeva | Borislav Rizov | Svetlozara Leseva
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

The paper presents a tool assisting manual annotation of linguistic data developed at the Department of Computational linguistics, IBL-BAS. Chooser is a general-purpose modular application for corpus annotation based on the principles of commonality and reusability of the created resources, language and theory independence, extendibility and user-friendliness. These features have been achieved through a powerful abstract architecture within the Model-View-Controller paradigm that is easily tailored to task-specific requirements and readily extendable to new applications. The tool is to a considerable extent independent of data format and representation and produces outputs that are largely consistent with existing standards. The annotated data are therefore reusable in tasks requiring different levels of annotation and are accessible to external applications. The tool incorporates edit functions, pass and arrangement strategies that facilitate annotators’ work. The relevant module produces tree-structured and graph-based representations in respective annotation modes. Another valuable feature of the application is concurrent access by multiple users and centralised storage of lexical resources underlying annotation schemata, as well as of annotations, including frequency of selection, updates in the lexical database, etc. Chooser has been successfully applied to a number of tasks: POS tagging, WS and syntactic annotation.

pdf bib
Hydra: a Modal Logic Tool for Wordnet Development, Validation and Exploration
Borislav Rizov
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

This paper presents a multipurpose system for wordnet (WN) development, named Hydra. Hydra is an application for data editing and validation, as well as for data retrieval and synchronization between wordnets for different languages. The use of modal language for wordnet, the representation of wordnet as a relational database and the concurrent access are among its main advantages.