An Automatic Vowel Space Generator for Language Learner Pronunciation Acquisition and Correction

Xinyuan Chao, Charbel El-Khaissi, Nicholas Kuo, Priscilla Kan John, Hanna Suominen


Abstract
Speech visualisations are known to help language learners to acquire correct pronunciation and promote a better study experience. We present a two-step approach based on two established techniques to display tongue tip movements of an acoustic speech signal on a vowel space plot. First we use Energy Entropy Ratio to extract vowels; and then we apply Linear Predictive Coding root method to estimate Formant 1 and Formant 2. We invited and collected acoustic data from one Modern Standard Arabic (MSA) lecture and four MSA students. Our proof of concept was able to reflect differences between the tongue tip movements in a native MSA speaker to those of a MSA language learner. This paper addresses principle methods for generating features that reflect bio-physiological features of speech and thus, facilitates an approach that can be generally adapted to languages other than MSA.
Anthology ID:
2020.alta-1.6
Volume:
Proceedings of the 18th Annual Workshop of the Australasian Language Technology Association
Month:
December
Year:
2020
Address:
Virtual Workshop
Editors:
Maria Kim, Daniel Beck, Meladel Mistica
Venue:
ALTA
SIG:
Publisher:
Australasian Language Technology Association
Note:
Pages:
54–64
Language:
URL:
https://aclanthology.org/2020.alta-1.6
DOI:
Bibkey:
Cite (ACL):
Xinyuan Chao, Charbel El-Khaissi, Nicholas Kuo, Priscilla Kan John, and Hanna Suominen. 2020. An Automatic Vowel Space Generator for Language Learner Pronunciation Acquisition and Correction. In Proceedings of the 18th Annual Workshop of the Australasian Language Technology Association, pages 54–64, Virtual Workshop. Australasian Language Technology Association.
Cite (Informal):
An Automatic Vowel Space Generator for Language Learner Pronunciation Acquisition and Correction (Chao et al., ALTA 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.alta-1.6.pdf