Sense Meets Nonsense - Sense Meets Nonsense - a dual-layer Danish speech corpus for perception studies

Thomas Ulrich Christiansen, Peter Juel Henrichsen


Abstract
In this paper, we present the newly established Danish speech corpus PiTu. The corpus consists of recordings of 28 native Danish talkers (14 female and 14 male) each reproducing (i) a series of nonsense syllables, and (ii) a set of authentic natural language sentences. The speech corpus is tailored for investigating the relationship between early stages of the speech perceptual process and later stages. We present our considerations involved in preparing the experimental set-up, producing the anechoic recordings, compiling the data, and exploring the materials in linguistic research. We report on a small pilot experiment demonstrating how PiTu and similar speech corpora can be used in studies of prosody as a function of semantic content. The experiment addresses the issue of whether the governing principles of Danish prosody assignment is mainly talker-specific or mainly content-typical (under the specific experimental conditions). The corpus is available in its entirety for download at http://amtoolbox.sourceforge.net/pitu/.
Anthology ID:
L12-1155
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3356–3361
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/330_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Thomas Ulrich Christiansen and Peter Juel Henrichsen. 2012. Sense Meets Nonsense - Sense Meets Nonsense - a dual-layer Danish speech corpus for perception studies. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 3356–3361, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
Sense Meets Nonsense - Sense Meets Nonsense - a dual-layer Danish speech corpus for perception studies (Christiansen & Henrichsen, LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/330_Paper.pdf