An Investigation of Hybrid architectures for Low Resource Multilingual Speech Recognition system in Indian context

Ganesh Mirishkar, Aditya Yadavalli, Anil Kumar Vuppala


Abstract
India is a land of language diversity. There are approximately 2000 languages spoken around, and among which officially registered are 23. In those, there are very few with Automatic Speech Recognition (ASR) capability. The reason for this is the fact that building an ASR system requires thousands of hours of annotated speech data, a vast amount of text, and a lexicon that can span all the words in the language. At the same time, it is observed that Indian languages share a common phonetic base. In this work, we build a multilingual speech recognition system for low-resource languages by leveraging the shared phonetic space. Deep Neural architectures play a vital role in improving the performance of low-resource ASR systems. The typical strategy used to train the multilingual acoustic model is merging various languages as a unified group. In this paper, the speech recognition system is built using six Indian languages, namely Gujarati, Hindi, Marathi, Odia, Tamil, and Telugu. Various state-of-the-art experiments were performed using different acoustic modeling and language modeling techniques.
Anthology ID:
2021.icon-main.25
Volume:
Proceedings of the 18th International Conference on Natural Language Processing (ICON)
Month:
December
Year:
2021
Address:
National Institute of Technology Silchar, Silchar, India
Editors:
Sivaji Bandyopadhyay, Sobha Lalitha Devi, Pushpak Bhattacharyya
Venue:
ICON
SIG:
Publisher:
NLP Association of India (NLPAI)
Note:
Pages:
205–212
Language:
URL:
https://aclanthology.org/2021.icon-main.25
DOI:
Bibkey:
Cite (ACL):
Ganesh Mirishkar, Aditya Yadavalli, and Anil Kumar Vuppala. 2021. An Investigation of Hybrid architectures for Low Resource Multilingual Speech Recognition system in Indian context. In Proceedings of the 18th International Conference on Natural Language Processing (ICON), pages 205–212, National Institute of Technology Silchar, Silchar, India. NLP Association of India (NLPAI).
Cite (Informal):
An Investigation of Hybrid architectures for Low Resource Multilingual Speech Recognition system in Indian context (Mirishkar et al., ICON 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.icon-main.25.pdf