Incremental processing of noisy user utterances in the spoken language understanding task

Stefan Constantin, Jan Niehues, Alex Waibel


Abstract
The state-of-the-art neural network architectures make it possible to create spoken language understanding systems with high quality and fast processing time. One major challenge for real-world applications is the high latency of these systems caused by triggered actions with high executions times. If an action can be separated into subactions, the reaction time of the systems can be improved through incremental processing of the user utterance and starting subactions while the utterance is still being uttered. In this work, we present a model-agnostic method to achieve high quality in processing incrementally produced partial utterances. Based on clean and noisy versions of the ATIS dataset, we show how to create datasets with our method to create low-latency natural language understanding components. We get improvements of up to 47.91 absolute percentage points in the metric F1-score.
Anthology ID:
D19-5535
Volume:
Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT 2019)
Month:
November
Year:
2019
Address:
Hong Kong, China
Editors:
Wei Xu, Alan Ritter, Tim Baldwin, Afshin Rahimi
Venue:
WNUT
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
265–274
Language:
URL:
https://aclanthology.org/D19-5535
DOI:
10.18653/v1/D19-5535
Bibkey:
Cite (ACL):
Stefan Constantin, Jan Niehues, and Alex Waibel. 2019. Incremental processing of noisy user utterances in the spoken language understanding task. In Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT 2019), pages 265–274, Hong Kong, China. Association for Computational Linguistics.
Cite (Informal):
Incremental processing of noisy user utterances in the spoken language understanding task (Constantin et al., WNUT 2019)
Copy Citation:
PDF:
https://aclanthology.org/D19-5535.pdf
Attachment:
 D19-5535.Attachment.zip