Riqiang Wang
2024
Double Decoder: Improving latency for Streaming End-to-end ASR Models
Riqiang Wang | Shreekantha Nadig | Daniil Kulko | Simon Vandieken | Chia-tien Chang | Seyyed Saeed Sarfjoo | Jonas Robertson
Proceedings of the 7th International Conference on Natural Language and Speech Processing (ICNLSP 2024)
Riqiang Wang | Shreekantha Nadig | Daniil Kulko | Simon Vandieken | Chia-tien Chang | Seyyed Saeed Sarfjoo | Jonas Robertson
Proceedings of the 7th International Conference on Natural Language and Speech Processing (ICNLSP 2024)
2021
Avengers, Ensemble! Benefits of ensembling in grapheme-to-phoneme prediction
Vagrant Gautam | Wang Yau Li | Zafarullah Mahmood | Fred Mailhot | Shreekantha Nadig | Riqiang Wang | Nathan Zhang
Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology
Vagrant Gautam | Wang Yau Li | Zafarullah Mahmood | Fred Mailhot | Shreekantha Nadig | Riqiang Wang | Nathan Zhang
Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology
We describe three baseline beating systems for the high-resource English-only sub-task of the SIGMORPHON 2021 Shared Task 1: a small ensemble that Dialpad’s speech recognition team uses internally, a well-known off-the-shelf model, and a larger ensemble model comprising these and others. We additionally discuss the challenges related to the provided data, along with the processing steps we took.