Elena Bruches
2023
Relation Extraction from Scientific Texts in Russian with Limited Training Data
Olga Tikhobaeva
|
Elena Bruches
Proceedings of the Second Workshop on Information Extraction from Scientific Publications
2022
TERMinator: A System for Scientific Texts Processing
Elena Bruches
|
Olga Tikhobaeva
|
Yana Dementyeva
|
Tatiana Batura
Proceedings of the 29th International Conference on Computational Linguistics
This paper is devoted to the extraction of entities and semantic relations between them from scientific texts, where we consider scientific terms as entities. In this paper, we present a dataset that includes annotations for two tasks and develop a system called TERMinator for the study of the influence of language models on term recognition and comparison of different approaches for relation extraction. Experiments show that language models pre-trained on the target language are not always show the best performance. Also adding some heuristic approaches may improve the overall quality of the particular task. The developed tool and the annotated corpus are publicly available at https://github.com/iis-research-team/terminator and may be useful for other researchers.
Search