Dialogue, Speech and Images: the Companions Project Data Set

Yorick Wilks, David Benyon, Christopher Brewster, Pavel Ircing, Oli Mival


Abstract
This paper describes part of the corpus collection efforts underway in the EC funded Companions project. The Companions project is collecting substantial quantities of dialogue a large part of which focus on reminiscing about photographs. The texts are in English and Czech. We describe the context and objectives for which this dialogue corpus is being collected, the methodology being used and make observations on the resulting data. The corpora will be made available to the wider research community through the Companions Project web site.
Anthology ID:
L08-1197
Volume:
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)
Month:
May
Year:
2008
Address:
Marrakech, Morocco
Editors:
Nicoletta Calzolari, Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Daniel Tapias
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2008/pdf/550_paper.pdf
DOI:
Bibkey:
Cite (ACL):
Yorick Wilks, David Benyon, Christopher Brewster, Pavel Ircing, and Oli Mival. 2008. Dialogue, Speech and Images: the Companions Project Data Set. In Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08), Marrakech, Morocco. European Language Resources Association (ELRA).
Cite (Informal):
Dialogue, Speech and Images: the Companions Project Data Set (Wilks et al., LREC 2008)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2008/pdf/550_paper.pdf