Where Are We at with Automatic Speech Recognition for the Bambara Language?

Seydou Diallo, Yacouba Diarra, Panga Azazia Kamaté, Aboubacar Ouattara, Mamadou K. Keita, Adam Bouno Kampo


Abstract
This paper introduces the first standardized benchmark for evaluating Automatic Speech Recognition (ASR) in the Bambara language, utilizing one hour of professionally recorded Malian constitutional text. Designed as a controlled reference set under near-optimal acoustic and linguistic conditions, the benchmark was used to evaluate 37 models, ranging from Bambara-trained systems to large-scale commercial models. Our findings reveal that current ASR performance remains significantly below deployment standards; the top-performing system in terms of Word Error Rate (WER) achieved 46.76% and the best Character Error Rate (CER) of 13.00% was set by another model, while several prominent multilingual models exceeded 100% WER due to severe hallucinations. These results suggest that multilingual pre-training and model scaling alone are insufficient for underrepresented languages. Furthermore, because this dataset represents a best-case scenario of the most simplified and formal form of spoken Bambara, these figures likely establish an upper bound for performance in practical, real-world settings. We provide the benchmark and an accompanying public leaderboard to facilitate transparent evaluation and future research in Bambara speech technology.
Anthology ID:
2026.africanlp-main.26
Volume:
Proceedings of the 7th Workshop on African Natural Language Processing (AfricaNLP 2026)
Month:
March
Year:
2026
Address:
Rabat, Morocco
Editors:
Everlyn Asiko Chimoto, Constantine Lignos, Shamsuddeen Muhammad, Idris Abdulmumin, Clemencia Siro, David Ifeoluwa Adelani
Venues:
AfricaNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
248–255
Language:
URL:
https://aclanthology.org/2026.africanlp-main.26/
DOI:
Bibkey:
Cite (ACL):
Seydou Diallo, Yacouba Diarra, Panga Azazia Kamaté, Aboubacar Ouattara, Mamadou K. Keita, and Adam Bouno Kampo. 2026. Where Are We at with Automatic Speech Recognition for the Bambara Language?. In Proceedings of the 7th Workshop on African Natural Language Processing (AfricaNLP 2026), pages 248–255, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):
Where Are We at with Automatic Speech Recognition for the Bambara Language? (Diallo et al., AfricaNLP 2026)
Copy Citation:
PDF:
https://aclanthology.org/2026.africanlp-main.26.pdf