Adapting Text-based Dialogue State Tracker for Spoken Dialogues

Jaeseok Yoon, Seunghyun Hwang, Han Ran, Jeong-Uk Bang, Kee-Eung Kim


Abstract
Although there have been remarkable advances in dialogue systems through the dialogue systems technology competition (DSTC), it remains one of the key challenges to building a robust task-oriented dialogue system with a speech interface. Most of the progress has been made for text-based dialogue systems since there are abundant datasets with written cor- pora while those with spoken dialogues are very scarce. However, as can be seen from voice assistant systems such as Siri and Alexa, it is of practical importance to transfer the success to spoken dialogues. In this paper, we describe our engineering effort in building a highly successful model that participated in the speech-aware dialogue systems technology challenge track in DSTC11. Our model consists of three major modules: (1) automatic speech recognition error correction to bridge the gap between the spoken and the text utterances, (2) text-based dialogue system (D3ST) for estimating the slots and values using slot descriptions, and (3) post-processing for recovering the error of the estimated slot value. Our experiments show that it is important to use an explicit automatic speech recognition error correction module, post-processing, and data augmentation to adapt a text-based dialogue state tracker for spoken dialogue corpora.
Anthology ID:
2023.dstc-1.10
Volume:
Proceedings of The Eleventh Dialog System Technology Challenge
Month:
September
Year:
2023
Address:
Prague, Czech Republic
Editors:
Yun-Nung Chen, Paul Crook, Michel Galley, Sarik Ghazarian, Chulaka Gunasekara, Raghav Gupta, Behnam Hedayatnia, Satwik Kottur, Seungwhan Moon, Chen Zhang
Venues:
DSTC | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
81–88
Language:
URL:
https://aclanthology.org/2023.dstc-1.10
DOI:
Bibkey:
Cite (ACL):
Jaeseok Yoon, Seunghyun Hwang, Han Ran, Jeong-Uk Bang, and Kee-Eung Kim. 2023. Adapting Text-based Dialogue State Tracker for Spoken Dialogues. In Proceedings of The Eleventh Dialog System Technology Challenge, pages 81–88, Prague, Czech Republic. Association for Computational Linguistics.
Cite (Informal):
Adapting Text-based Dialogue State Tracker for Spoken Dialogues (Yoon et al., DSTC-WS 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.dstc-1.10.pdf