Regional Variation in the Performance of ASR Models on Croatian and Serbian

Tanja Samardžić, Peter Rupnik, Nikola Ljubešić


Abstract
Regional variation was a limiting factor for automatic speech recognition (ASR) before large language models. With the new technology, speech processing becomes more general, which opens the question of how to use data in similar languages such as Croatian and Serbian. In this paper, we analyse model performance in the currently available train-test scenarios with the goal of better understanding the mutual interference of these two languages. Our findings suggest that better performing models are not very sensitive to the regional variation. Training from scratch in one of the languages can give good results on both of them, while fine-tuning large pre-trained multilingual models on smaller data sets does not give the expected results.
Anthology ID:
2026.vardial-1.20
Volume:
Proceedings of the 13th Workshop on NLP for Similar Languages, Varieties and Dialects
Month:
March
Year:
2026
Address:
Rabat, Morocco
Venues:
VarDial | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
242–249
Language:
URL:
https://aclanthology.org/2026.vardial-1.20/
DOI:
Bibkey:
Cite (ACL):
Tanja Samardžić, Peter Rupnik, and Nikola Ljubešić. 2026. Regional Variation in the Performance of ASR Models on Croatian and Serbian. In Proceedings of the 13th Workshop on NLP for Similar Languages, Varieties and Dialects, pages 242–249, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):
Regional Variation in the Performance of ASR Models on Croatian and Serbian (Samardžić et al., VarDial 2026)
Copy Citation:
PDF:
https://aclanthology.org/2026.vardial-1.20.pdf