Liam Lonergan


2022

pdf bib
Automatic Speech Recognition for Irish: the ABAIR-ÉIST System
Liam Lonergan | Mengjie Qian | Harald Berthelsen | Andy Murphy | Christoph Wendler | Neasa Ní Chiaráin | Christer Gobl | Ailbhe Ní Chasaide
Proceedings of the 4th Celtic Language Technology Workshop within LREC2022

This paper describes ÉIST, automatic speech recogniser for Irish, developed as part of the ongoing ABAIR initiative, combining (1) acoustic models, (2) pronunciation lexicons and (3) language models into a hybrid system. A priority for now is a system that can deal with the multiple diverse native-speaker dialects. Consequently, (1) was built using predominately native-speaker speech, which included earlier recordings used for synthesis development as well as more diverse recordings obtained using the MíleGlór platform. The pronunciation variation across the dialects is a particular challenge in the development of (2) and is explored by testing both Trans-dialect and Multi-dialect letter-to-sound rules. Two approaches to language modelling (3) are used in the hybrid system, a simple n-gram model and recurrent neural network lattice rescoring, the latter garnering impressive performance improvements. The system is evaluated using a test set that is comprised of both native and non-native speakers, which allows for some inferences to be made on the performance of the system on both cohorts.