Improved pronunciation prediction accuracy using morphology

Dravyansh Sharma, Saumya Sahai, Neha Chaudhari, Antoine Bruguier


Abstract
Pronunciation lexicons and prediction models are a key component in several speech synthesis and recognition systems. We know that morphologically related words typically follow a fixed pattern of pronunciation which can be described by language-specific paradigms. In this work we explore how deep recurrent neural networks can be used to automatically learn and exploit this pattern to improve the pronunciation prediction quality of words related by morphological inflection. We propose two novel approaches for supplying morphological information, using the word’s morphological class and its lemma, which are typically annotated in standard lexicons. We report improvements across a number of European languages with varying degrees of phonological and morphological complexity, and two language families, with greater improvements for languages where the pronunciation prediction task is inherently more challenging. We also observe that combining bidirectional LSTM networks with attention mechanisms is an effective neural approach for the computational problem considered, across languages. Our approach seems particularly beneficial in the low resource setting, both by itself and in conjunction with transfer learning.
Anthology ID:
2021.sigmorphon-1.24
Volume:
Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology
Month:
August
Year:
2021
Address:
Online
Editors:
Garrett Nicolai, Kyle Gorman, Ryan Cotterell
Venue:
SIGMORPHON
SIG:
SIGMORPHON
Publisher:
Association for Computational Linguistics
Note:
Pages:
222–228
Language:
URL:
https://aclanthology.org/2021.sigmorphon-1.24
DOI:
10.18653/v1/2021.sigmorphon-1.24
Bibkey:
Cite (ACL):
Dravyansh Sharma, Saumya Sahai, Neha Chaudhari, and Antoine Bruguier. 2021. Improved pronunciation prediction accuracy using morphology. In Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, pages 222–228, Online. Association for Computational Linguistics.
Cite (Informal):
Improved pronunciation prediction accuracy using morphology (Sharma et al., SIGMORPHON 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.sigmorphon-1.24.pdf
Video:
 https://aclanthology.org/2021.sigmorphon-1.24.mp4