French Learners Audio Corpus of German Speech (FLACGS)

Jane Wottawa, Martine Adda-Decker


Abstract
The French Learners Audio Corpus of German Speech (FLACGS) was created to compare German speech production of German native speakers (GG) and French learners of German (FG) across three speech production tasks of increasing production complexity: repetition, reading and picture description. 40 speakers, 20 GG and 20 FG performed each of the three tasks, which in total leads to approximately 7h of speech. The corpus was manually transcribed and automatically aligned. Analysis that can be performed on this type of corpus are for instance segmental differences in the speech production of L2 learners compared to native speakers. We chose the realization of the velar nasal consonant engma. In spoken French, engma does not appear in a VCV context which leads to production difficulties in FG. With increasing speech production complexity (reading and picture description), engma is realized as engma + plosive by FG in over 50% of the cases. The results of a two way ANOVA with unequal sample sizes on the durations of the different realizations of engma indicate that duration is a reliable factor to distinguish between engma and engma + plosive in FG productions compared to the engma productions in GG in a VCV context. The FLACGS corpus allows to study L2 production and perception.
Anthology ID:
L16-1512
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3215–3219
Language:
URL:
https://aclanthology.org/L16-1512
DOI:
Bibkey:
Cite (ACL):
Jane Wottawa and Martine Adda-Decker. 2016. French Learners Audio Corpus of German Speech (FLACGS). In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 3215–3219, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
French Learners Audio Corpus of German Speech (FLACGS) (Wottawa & Adda-Decker, LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1512.pdf