End-to-End Speech Recognition for Endangered Languages of Nepal

Marieke Meelen, Alexander O’neill, Rolando Coto-Solano


Abstract
This paper presents three experiments to test the most effective and efficient ASR pipeline to facilitate the documentation and preservation of endangered languages, which are often extremely low-resourced. With data from two languages in Nepal —Dzardzongke and Newar— we show that model improvements are different for different masses of data, and that transfer learning as well as a range of modifications (e.g. normalising amplitude and pitch) can be effective, but that a consistently-standardised orthography as NLP input and post-training dictionary corrections improve results even more.
Anthology ID:
2024.computel-1.12
Volume:
Proceedings of the Seventh Workshop on the Use of Computational Methods in the Study of Endangered Languages
Month:
March
Year:
2024
Address:
St. Julians, Malta
Editors:
Sarah Moeller, Godfred Agyapong, Antti Arppe, Aditi Chaudhary, Shruti Rijhwani, Christopher Cox, Ryan Henke, Alexis Palmer, Daisy Rosenblum, Lane Schwartz
Venues:
ComputEL | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
83–93
Language:
URL:
https://aclanthology.org/2024.computel-1.12
DOI:
Bibkey:
Cite (ACL):
Marieke Meelen, Alexander O’neill, and Rolando Coto-Solano. 2024. End-to-End Speech Recognition for Endangered Languages of Nepal. In Proceedings of the Seventh Workshop on the Use of Computational Methods in the Study of Endangered Languages, pages 83–93, St. Julians, Malta. Association for Computational Linguistics.
Cite (Informal):
End-to-End Speech Recognition for Endangered Languages of Nepal (Meelen et al., ComputEL-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.computel-1.12.pdf