Purely Corpus-based Automatic Conversation Authoring

Guillaume Dubuisson Duplessis, Vincent Letard, Anne-Laure Ligozat, Sophie Rosset


Abstract
This paper presents an automatic corpus-based process to author an open-domain conversational strategy usable both in chatterbot systems and as a fallback strategy for out-of-domain human utterances. Our approach is implemented on a corpus of television drama subtitles. This system is used as a chatterbot system to collect a corpus of 41 open-domain textual dialogues with 27 human participants. The general capabilities of the system are studied through objective measures and subjective self-reports in terms of understandability, repetition and coherence of the system responses selected in reaction to human utterances. Subjective evaluations of the collected dialogues are presented with respect to amusement, engagement and enjoyability. The main factors influencing those dimensions in our chatterbot experiment are discussed.
Anthology ID:
L16-1433
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2728–2735
Language:
URL:
https://aclanthology.org/L16-1433
DOI:
Bibkey:
Cite (ACL):
Guillaume Dubuisson Duplessis, Vincent Letard, Anne-Laure Ligozat, and Sophie Rosset. 2016. Purely Corpus-based Automatic Conversation Authoring. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 2728–2735, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Purely Corpus-based Automatic Conversation Authoring (Dubuisson Duplessis et al., LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1433.pdf