Set-up of a Unit-Selection Synthesis with a Prominent Voice

Stefan Breuer; Sven Bergmann; Ralf Dragon; Sebastian Möller

Set-up of a Unit-Selection Synthesis with a Prominent Voice

Stefan Breuer, Sven Bergmann, Ralf Dragon, Sebastian Möller

Abstract

In this paper, we describe the set-up process and an initial evaluation of a unit-selection speech synthesizer. The synthesizer is specific in that it is intended to speak with a prominent voice. As a consequence, only very limited resources were available for setting up the unit database. These resources have been extracted from an audio book, segmented with the help of an HMM-based wrapper, and then used with the non-uniform unit-selection approach implemented in the Bonn Open Synthesis System (BOSS). In order to adapt the database to the BOSS implementation, the label files were amended by phrase boundaries, converted to XML, amended by prosodic and spectral information, and then further converted to a MySQL relational database structure. The BOSS system selects units on the basis of this information, adding individual unit costs to the concatenation costs given by MFCC and F0 distances. The paper discusses the problems which occurred during the database set-up, the invested effort, as well as the quality level which can be reached by this approach.

Anthology ID:: L06-1173
Volume:: Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)
Month:: May
Year:: 2006
Address:: Genoa, Italy
Editors:: Nicoletta Calzolari, Khalid Choukri, Aldo Gangemi, Bente Maegaard, Joseph Mariani, Jan Odijk, Daniel Tapias
Venue:: LREC
SIG:
Publisher:: European Language Resources Association (ELRA)
Note:
Pages:
Language:
URL:: http://www.lrec-conf.org/proceedings/lrec2006/pdf/307_pdf.pdf
DOI:
Bibkey:
Cite (ACL):: Stefan Breuer, Sven Bergmann, Ralf Dragon, and Sebastian Möller. 2006. Set-up of a Unit-Selection Synthesis with a Prominent Voice. In Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06), Genoa, Italy. European Language Resources Association (ELRA).
Cite (Informal):: Set-up of a Unit-Selection Synthesis with a Prominent Voice (Breuer et al., LREC 2006)
Copy Citation:
PDF:: http://www.lrec-conf.org/proceedings/lrec2006/pdf/307_pdf.pdf

PDF Cite Search Fix data