Multimodal Corpus of Multi-party Conversations in Second Language

Shota Yamasaki, Hirohisa Furukawa, Masafumi Nishida, Kristiina Jokinen, Seiichi Yamamoto


Abstract
We developed a dialogue-based tutoring system for teaching English to Japanese students and plan to transfer the current software tutoring agent into an embodied robot in the hope that the robot will enrich conversation by allowing more natural interactions in small group learning situations. To enable smooth communication between an intelligent agent and the user, the agent must have realistic models on when to take turns, when to interrupt, and how to catch the partner's attention. For developing the realistic models applicable for computer assisted language learning systems, we also need to consider the differences between the mother tongue and second language that affect communication style. We collected a multimodal corpus of multi-party conversations in English as the second language to investigate the differences in communication styles. We describe our multimodal corpus and explore features of communication style e.g. filled pauses, and non-verbal information, such as eye-gaze, which show different characteristics between the mother tongue and second language.
Anthology ID:
L12-1120
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
416–421
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/280_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Shota Yamasaki, Hirohisa Furukawa, Masafumi Nishida, Kristiina Jokinen, and Seiichi Yamamoto. 2012. Multimodal Corpus of Multi-party Conversations in Second Language. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 416–421, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
Multimodal Corpus of Multi-party Conversations in Second Language (Yamasaki et al., LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/280_Paper.pdf