Interactive Evaluation of Dialog Track at DSTC9

Shikib Mehri; Yulan Feng; Carla Gordon; Seyed Hossein Alavi; David Traum; Maxine Eskenazi

Interactive Evaluation of Dialog Track at DSTC9

Shikib Mehri, Yulan Feng, Carla Gordon, Seyed Hossein Alavi, David Traum, Maxine Eskenazi

Abstract

The ultimate goal of dialog research is to develop systems that can be effectively used in interactive settings by real users. To this end, we introduced the Interactive Evaluation of Dialog Track at the 9th Dialog System Technology Challenge. This track consisted of two sub-tasks. The first sub-task involved building knowledge-grounded response generation models. The second sub-task aimed to extend dialog models beyond static datasets by assessing them in an interactive setting with real users. Our track challenges participants to develop strong response generation models and explore strategies that extend them to back-and-forth interactions with real users. The progression from static corpora to interactive evaluation introduces unique challenges and facilitates a more thorough assessment of open-domain dialog systems. This paper provides an overview of the track, including the methodology and results. Furthermore, it provides insights into how to best evaluate open-domain dialog models.

Anthology ID:: 2022.lrec-1.616
Volume:: Proceedings of the Thirteenth Language Resources and Evaluation Conference
Month:: June
Year:: 2022
Address:: Marseille, France
Editors:: Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Jan Odijk, Stelios Piperidis
Venue:: LREC
SIG:
Publisher:: European Language Resources Association
Note:
Pages:: 5731–5738
Language:
URL:: https://aclanthology.org/2022.lrec-1.616/
DOI:
Bibkey:
Cite (ACL):: Shikib Mehri, Yulan Feng, Carla Gordon, Seyed Hossein Alavi, David Traum, and Maxine Eskenazi. 2022. Interactive Evaluation of Dialog Track at DSTC9. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 5731–5738, Marseille, France. European Language Resources Association.
Cite (Informal):: Interactive Evaluation of Dialog Track at DSTC9 (Mehri et al., LREC 2022)
Copy Citation:
PDF:: https://aclanthology.org/2022.lrec-1.616.pdf

PDF Cite Search Fix data