Automatic Detection of the Bulgarian Evidential Renarrative

Irina Temnikova; Ruslana Margova; Stefan Minkov; Tsvetelina Stefanova; Nevena Grigorova; Silvia Gargova; Venelin Kovatchev

doi:10.47810/JCLIB.1.2025.04

Automatic Detection of the Bulgarian Evidential Renarrative

Irina Temnikova, Ruslana Margova, Stefan Minkov, Tsvetelina Stefanova, Nevena Grigorova, Silvia Gargova, Venelin Kovatchev

Abstract

Manual and automatic verification of the trustworthiness of information is an important task. Knowing whether the author of a statement was an eyewitness to the reported event(s) is a useful clue. In linguistics, such information is expressed through “evidentiality”. Evidentials are especially important in Bulgarian, as Bulgarian journalists often use a specific type of evidential (“renarrative”) to report events that they did not directly observe, nor verify. Unfortunately, there are no automatic tools to detect Bulgarian renarrative. This article presents the first two automatic solutions for this task. Specifically - a fine-tuned BERT classifier (renarrative BERT detector, BGRenBERT), achieving 0.98 Accuracy on the test split, and a renarrative rulebased detector (BGRenRules), created with regular expressions, matching a parser’s output. Both solutions detect Bulgarian texts containing the most frequently encountered forms of renarrative. Additionally, we compare the results of the two detectors with the manual annotation of subsets of two Bulgarian fake text datasets. BGRenRules obtains substantially higher results than BGRenBERT. The error analysis shows that the errors from BGRenRules most frequently correspond to cases in which humans also have doubts. The training dataset (BgRenData), the annotated dataset subsets, and the two detectors are made publicly accessible on Zenodo, GitHub, and HuggingFace. We expect that these new resources will be of invaluable assistance to 1) Bulgarian-language researchers, 2) researchers of other languages with similar phenomena, especially those working on verifying information.

Anthology ID:: 2025.jclib-1.4
Volume:: Journal Computational Linguistics in Bulgaria
Month:: July
Year:: 2025
Address:: Sofia, Bulgaria
Editor:: Svetla Koeva
Venue:: JCLIB
SIG:
Publisher:: Institute for Bulgarian Language, Department of Computational Linguistics, Bulgarian Academy of Sciences
Note:
Pages:: 61–83
Language:
URL:: https://aclanthology.org/2025.jclib-1.4/
DOI:: 10.47810/JCLIB.1.2025.04
Bibkey:
Cite (ACL):: Irina Temnikova, Ruslana Margova, Stefan Minkov, Tsvetelina Stefanova, Nevena Grigorova, Silvia Gargova, and Venelin Kovatchev. 2025. Automatic Detection of the Bulgarian Evidential Renarrative. Journal Computational Linguistics in Bulgaria, 1:61–83.
Cite (Informal):: Automatic Detection of the Bulgarian Evidential Renarrative (Temnikova et al., JCLIB 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.jclib-1.4.pdf

PDF Cite Search Fix data