Phoneme Alignment Using the Information on Phonological Processes in Continuous Speech

Daniil Kocharov


Abstract
The current study focuses on optimization of Levenshtein algorithm for the purpose of computing the optimal alignment between two phoneme transcriptions of spoken utterance containing sequences of phonetic symbols. The alignment is computed with the help of a confusion matrix in which costs for phonetic symbol deletion, insertion and substitution are defined taking into account various phonological processes that occur in fluent speech, such as anticipatory assimilation, phone elision and epenthesis. The corpus containing about 30 hours of Russian read speech was used to evaluate the presented algorithms. The experimental results have shown significant reduction of misalignment rate in comparison with the baseline Levenshtein algorithm: the number of errors has been reduced from 1.1 % to 0.28 %
Anthology ID:
L16-1308
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
1944–1948
Language:
URL:
https://aclanthology.org/L16-1308
DOI:
Bibkey:
Cite (ACL):
Daniil Kocharov. 2016. Phoneme Alignment Using the Information on Phonological Processes in Continuous Speech. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 1944–1948, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Phoneme Alignment Using the Information on Phonological Processes in Continuous Speech (Kocharov, LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1308.pdf