Incremental Speech Processing for Voice Assistant Accessibility

Angus Addlesee


Abstract
Speech production is nuanced and unique to every individual, but today’s Spoken Dialogue Systems (SDSs) are trained to use general speech patterns to successfully improve performance on various evaluation metrics. However, these patterns do not apply to certain user groups - often the very people that can benefit the most from SDSs. For example, people with dementia produce more disfluent speech than the general population. The healthcare domain is now a popular setting for spoken dialogue and human-robot interaction research. This trend is similar when observing company behaviour. Charities promote industry voice assistants, the creators are getting HIPAA compliance, and their features sometimes target vulnerable user groups. It is therefore critical to adapt SDSs to be more accessible.
Anthology ID:
2023.yrrsds-1.3
Volume:
Proceedings of the 19th Annual Meeting of the Young Reseachers' Roundtable on Spoken Dialogue Systems
Month:
September
Year:
2023
Address:
Prague, Czechia
Editors:
Vojtech Hudecek, Patricia Schmidtova, Tanvi Dinkar, Javier Chiyah-Garcia, Weronika Sieinska
Venues:
YRRSDS | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
9–11
Language:
URL:
https://aclanthology.org/2023.yrrsds-1.3
DOI:
Bibkey:
Cite (ACL):
Angus Addlesee. 2023. Incremental Speech Processing for Voice Assistant Accessibility. In Proceedings of the 19th Annual Meeting of the Young Reseachers' Roundtable on Spoken Dialogue Systems, pages 9–11, Prague, Czechia. Association for Computational Linguistics.
Cite (Informal):
Incremental Speech Processing for Voice Assistant Accessibility (Addlesee, YRRSDS-WS 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.yrrsds-1.3.pdf