Evaluating Hallucinations in Large Language Models for Bulgarian Language

Melania Berbatova, Yoan Salambashev


Abstract
In this short paper, we introduce the task of evaluating the hallucination of large language models for the Bulgarian language. We first give definitions of what is a hallucination in large language models and what evaluation methods for measuring hallucinations exist. Next, we give an overview of the multilingual evaluation of the latest large language models, focusing on the evaluation of the performance in Bulgarian on tasks, related to hallucination. We then present a method to evaluate the level of hallucination in a given language with no reference data, and provide some initial experiments with this method in Bulgarian. Finally, we provide directions for future research on the topic.
Anthology ID:
2023.ranlp-stud.6
Volume:
Proceedings of the 8th Student Research Workshop associated with the International Conference Recent Advances in Natural Language Processing
Month:
September
Year:
2023
Address:
Varna, Bulgaria
Editors:
Momchil Hardalov, Zara Kancheva, Boris Velichkov, Ivelina Nikolova-Koleva, Milena Slavcheva
Venue:
RANLP
SIG:
Publisher:
INCOMA Ltd., Shoumen, Bulgaria
Note:
Pages:
55–63
Language:
URL:
https://aclanthology.org/2023.ranlp-stud.6
DOI:
Bibkey:
Cite (ACL):
Melania Berbatova and Yoan Salambashev. 2023. Evaluating Hallucinations in Large Language Models for Bulgarian Language. In Proceedings of the 8th Student Research Workshop associated with the International Conference Recent Advances in Natural Language Processing, pages 55–63, Varna, Bulgaria. INCOMA Ltd., Shoumen, Bulgaria.
Cite (Informal):
Evaluating Hallucinations in Large Language Models for Bulgarian Language (Berbatova & Salambashev, RANLP 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.ranlp-stud.6.pdf