Revisiting the Boundary between ASR and NLU in the Age of Conversational Dialog Systems

Manaal Faruqui, Dilek Hakkani-Tür


Abstract
As more users across the world are interacting with dialog agents in their daily life, there is a need for better speech understanding that calls for renewed attention to the dynamics between research in automatic speech recognition (ASR) and natural language understanding (NLU). We briefly review these research areas and lay out the current relationship between them. In light of the observations we make in this article, we argue that (1) NLU should be cognizant of the presence of ASR models being used upstream in a dialog system’s pipeline, (2) ASR should be able to learn from errors found in NLU, (3) there is a need for end-to-end data sets that provide semantic annotations on spoken input, (4) there should be stronger collaboration between ASR and NLU research communities.
Anthology ID:
2022.cl-1.8
Volume:
Computational Linguistics, Volume 48, Issue 1 - March 2022
Month:
March
Year:
2022
Address:
Cambridge, MA
Venue:
CL
SIG:
Publisher:
MIT Press
Note:
Pages:
221–232
Language:
URL:
https://aclanthology.org/2022.cl-1.8
DOI:
10.1162/coli_a_00430
Bibkey:
Cite (ACL):
Manaal Faruqui and Dilek Hakkani-Tür. 2022. Revisiting the Boundary between ASR and NLU in the Age of Conversational Dialog Systems. Computational Linguistics, 48(1):221–232.
Cite (Informal):
Revisiting the Boundary between ASR and NLU in the Age of Conversational Dialog Systems (Faruqui & Hakkani-Tür, CL 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.cl-1.8.pdf
Data
CoQASQuADSpoken-SQuAD