Iñigo López de Letona


2006

pdf bib
Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA
José-Miguel Benedí | Eduardo Lleida | Amparo Varona | María-José Castro | Isabel Galiano | Raquel Justo | Iñigo López de Letona | Antonio Miguel
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)

In the framework of the DIHANA project, we present the acquisitionprocess of a spontaneous speech dialogue corpus in Spanish. Theselected application consists of information retrieval by telephone for nationwide trains. A total of 900 dialogues from 225 users were acquired using the “Wizard of Oz” technique. In this work, we present the design and planning of the dialogue scenes and the wizard strategy used for the acquisition of the corpus. Then, we also present the acquisition tools and a description of the acquisition process.