Automatic Speech Recognition System for Malasar Language using Multilingual Transfer Learning

Basil K. Raju, Leena G. Pillai, Kavya Manohar, Elizabeth Sherly


Abstract
This study pioneers the development of an automatic speech recognition (ASR) system for the Malasar language, an extremely low-resource ethnic language spoken by a tribal community in the Western Ghats of South India. Malasar is primarily an oral language which does not have a native script. Therefore, Malasar is often transcribed in Tamil script, a closely related major language. This work presents the first ever effort of leveraging the capabilities of multilingual transfer learning for recognising malasar speech. We fine-tune a pre-trained multilingual transformer model with Malasar speech data. In our endeavour to fine-tune this model using a Malasar speech corpus, we could successfully bring down the WER to 48.00% from 99.08% (zero shot baseline). This work demonstrates the efficacy of multilingual transfer learning in addressing the challenges of ASR for extremely low-resource languages, contributing to the preservation of their linguistic and cultural heritage.
Anthology ID:
2023.icon-1.41
Volume:
Proceedings of the 20th International Conference on Natural Language Processing (ICON)
Month:
December
Year:
2023
Address:
Goa University, Goa, India
Editors:
Jyoti D. Pawar, Sobha Lalitha Devi
Venue:
ICON
SIG:
SIGLEX
Publisher:
NLP Association of India (NLPAI)
Note:
Pages:
472–477
Language:
URL:
https://aclanthology.org/2023.icon-1.41
DOI:
Bibkey:
Cite (ACL):
Basil K. Raju, Leena G. Pillai, Kavya Manohar, and Elizabeth Sherly. 2023. Automatic Speech Recognition System for Malasar Language using Multilingual Transfer Learning. In Proceedings of the 20th International Conference on Natural Language Processing (ICON), pages 472–477, Goa University, Goa, India. NLP Association of India (NLPAI).
Cite (Informal):
Automatic Speech Recognition System for Malasar Language using Multilingual Transfer Learning (K. Raju et al., ICON 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.icon-1.41.pdf