Revisiting Automatic Speech Recognition for Tamil and Hindi Connected Number Recognition

Rahul Mishra; Senthil Raja Gunaseela Boopathy; Manikandan Ravikiran; Shreyas Kulkarni; Mayurakshi Mukherjee; Ananth Ganesh; Kingshuk Banerjee

Revisiting Automatic Speech Recognition for Tamil and Hindi Connected Number Recognition

Rahul Mishra, Senthil Raja Gunaseela Boopathy, Manikandan Ravikiran, Shreyas Kulkarni, Mayurakshi Mukherjee, Ananth Ganesh, Kingshuk Banerjee

Abstract

Automatic Speech Recognition and its applications are rising in popularity across applications with reasonable inference results. Recent state-of-the-art approaches, often employ significantly large-scale models to show high accuracy for ASR as a whole but often do not consider detailed analysis of performance across low-resource languages applications. In this preliminary work, we propose to revisit ASR in the context of Connected Number Recognition (CNR). More specifically, we (i) present a new dataset HCNR collected to understand various errors of ASR models for CNR, (ii) establish preliminary benchmark and baseline model for CNR, (iii) explore error mitigation strategies and their after-effects on CNR. In the due process, we also compare with end-to-end large scale ASR models for reference, to show its effectiveness.

Anthology ID:: 2023.dravidianlangtech-1.15
Volume:: Proceedings of the Third Workshop on Speech and Language Technologies for Dravidian Languages
Month:: September
Year:: 2023
Address:: Varna, Bulgaria
Editors:: Bharathi R. Chakravarthi, Ruba Priyadharshini, Anand Kumar M, Sajeetha Thavareesan, Elizabeth Sherly
Venues:: DravidianLangTech | WS
SIG:
Publisher:: INCOMA Ltd., Shoumen, Bulgaria
Note:
Pages:: 116–123
Language:
URL:: https://aclanthology.org/2023.dravidianlangtech-1.15/
DOI:
Bibkey:
Cite (ACL):: Rahul Mishra, Senthil Raja Gunaseela Boopathy, Manikandan Ravikiran, Shreyas Kulkarni, Mayurakshi Mukherjee, Ananth Ganesh, and Kingshuk Banerjee. 2023. Revisiting Automatic Speech Recognition for Tamil and Hindi Connected Number Recognition. In Proceedings of the Third Workshop on Speech and Language Technologies for Dravidian Languages, pages 116–123, Varna, Bulgaria. INCOMA Ltd., Shoumen, Bulgaria.
Cite (Informal):: Revisiting Automatic Speech Recognition for Tamil and Hindi Connected Number Recognition (Mishra et al., DravidianLangTech 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.dravidianlangtech-1.15.pdf

PDF Cite Search Fix data