Riqiang Wang
2024
Double Decoder: Improving latency for Streaming End-to-end ASR Models
Riqiang Wang
|
Shreekantha Nadig
|
Daniil Kulko
|
Simon Vandieken
|
Chia-tien Chang
|
Seyyed Saeed Sarfjoo
|
Jonas Robertson
Proceedings of the 7th International Conference on Natural Language and Speech Processing (ICNLSP 2024)
2021
Avengers, Ensemble! Benefits of ensembling in grapheme-to-phoneme prediction
Vagrant Gautam
|
Wang Yau Li
|
Zafarullah Mahmood
|
Fred Mailhot
|
Shreekantha Nadig
|
Riqiang Wang
|
Nathan Zhang
Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology
We describe three baseline beating systems for the high-resource English-only sub-task of the SIGMORPHON 2021 Shared Task 1: a small ensemble that Dialpad’s speech recognition team uses internally, a well-known off-the-shelf model, and a larger ensemble model comprising these and others. We additionally discuss the challenges related to the provided data, along with the processing steps we took.
Search
Co-authors
- Shreekantha Nadig 2
- Vagrant Gautam 1
- Wang Yau Li 1
- Zafarullah Mahmood 1
- Frederic Mailhot 1
- show all...