PATATRA and PATAFreq: two French databases for the documentation of within-speaker variability in speech

Cécile Fougeron, Nicolas Audibert, Cedric Gendrot, Estelle Chardenon, Louise Wohmann


Abstract
Our knowledge on speech is historically built on data comparing different speakers or data averaged across speakers. Consequently, little is known on the variability in the speech of a single individual. Experimental studies have shown that speakers adapt to the linguistic and the speaking contexts, and modify their speech according to their emotional or biological condition, etc. However, it is unclear how much speakers vary from one repetition to the next, and how comparable are recordings that are collected days, months or years apart. In this paper, we introduce two French databases which contain recordings of 9 to 11 speakers recorded over 9 to 18 sessions, allowing comparisons of speech tasks with a different delay between the repetitions: 3 repetitions within the same session, 6 to 10 repetitions on different days during a two months period, 5 to 9 repetitions on different years. Speakers are recorded on a large set of speech tasks including read and spontaneous speech as well as speech-like performance tasks. In this paper, we provide detailed descriptions of the two databases and available annotations. We conclude by an illustration on how these data can inform on within-speaker variability of speech.
Anthology ID:
2022.lrec-1.207
Volume:
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
1939–1944
Language:
URL:
https://aclanthology.org/2022.lrec-1.207
DOI:
Bibkey:
Cite (ACL):
Cécile Fougeron, Nicolas Audibert, Cedric Gendrot, Estelle Chardenon, and Louise Wohmann. 2022. PATATRA and PATAFreq: two French databases for the documentation of within-speaker variability in speech. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 1939–1944, Marseille, France. European Language Resources Association.
Cite (Informal):
PATATRA and PATAFreq: two French databases for the documentation of within-speaker variability in speech (Fougeron et al., LREC 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.lrec-1.207.pdf