Whispering in Ol Chiki: Cross-Lingual Transfer Learning for Santali Speech Recognition

Atanu Mandal; Madhusudan Ghosh; Pratick Maiti; Sudip Kumar Naskar

doi:10.18653/v1/2025.findings-ijcnlp.16

Whispering in Ol Chiki: Cross-Lingual Transfer Learning for Santali Speech Recognition

Atanu Mandal, Madhusudan Ghosh, Pratick Maiti, Sudip Kumar Naskar

Abstract

India, a country with a large population, possesses two official and twenty-two scheduled languages, making it the most linguistically diverse nation. Despite being one of the scheduled languages, Santali remains a low-resource language. Although Ol Chiki is recognized as the official script for Santali, many continue to use Bengali, Devanagari, Odia, and Roman scripts. In tribute to the upcoming centennial anniversary of the Ol Chiki script, we present an Automatic Speech Recognition for Santali in the Ol Chiki script. Our approach involves cross-lingual transfer learning by utilizing the Whisper framework pre-trained in Bengali and Hindi on the Santali language, using Ol Chiki script transcriptions. With the adoption of the Bengali pre-trained framework, we achieved a Word Error Rate (WER) score of 28.47%, whereas the adaptation of the Hindi pre-trained framework resulted in a score of 34.50% WER. These outcomes were obtained using the Whisper Small framework.

Anthology ID:: 2025.findings-ijcnlp.16
Volume:: Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics
Month:: December
Year:: 2025
Address:: Mumbai, India
Editors:: Kentaro Inui, Sakriani Sakti, Haofen Wang, Derek F. Wong, Pushpak Bhattacharyya, Biplab Banerjee, Asif Ekbal, Tanmoy Chakraborty, Dhirendra Pratap Singh
Venue:: Findings
SIG:
Publisher:: The Asian Federation of Natural Language Processing and The Association for Computational Linguistics
Note:
Pages:: 269–278
Language:
URL:: https://aclanthology.org/2025.findings-ijcnlp.16/
DOI:: 10.18653/v1/2025.findings-ijcnlp.16
Bibkey:
Cite (ACL):: Atanu Mandal, Madhusudan Ghosh, Pratick Maiti, and Sudip Kumar Naskar. 2025. Whispering in Ol Chiki: Cross-Lingual Transfer Learning for Santali Speech Recognition. In Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, pages 269–278, Mumbai, India. The Asian Federation of Natural Language Processing and The Association for Computational Linguistics.
Cite (Informal):: Whispering in Ol Chiki: Cross-Lingual Transfer Learning for Santali Speech Recognition (Mandal et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-ijcnlp.16.pdf

PDF Cite Search Fix data