End-to-End Speech to Intent Prediction to improve E-commerce Customer Support Voicebot in Hindi and English

Abhinav Goyal, Anupam Singh, Nikesh Garera


Abstract
Automation of on-call customer support relies heavily on accurate and efficient speech-to-intent (S2I) systems. Building such systems using multi-component pipelines can pose various challenges because they require large annotated datasets, have higher latency, and have complex deployment. These pipelines are also prone to compounding errors. To overcome these challenges, we discuss an end-to-end (E2E) S2I model for customer support voicebot task in a bilingual setting. We show how we can solve E2E intent classification by leveraging a pre-trained automatic speech recognition (ASR) model with slight modification and fine-tuning on small annotated datasets. Experimental results show that our best E2E model outperforms a conventional pipeline by a relative ~27% on the F1 score.
Anthology ID:
2022.emnlp-industry.59
Volume:
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: Industry Track
Month:
December
Year:
2022
Address:
Abu Dhabi, UAE
Editors:
Yunyao Li, Angeliki Lazaridou
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
579–586
Language:
URL:
https://aclanthology.org/2022.emnlp-industry.59
DOI:
10.18653/v1/2022.emnlp-industry.59
Bibkey:
Cite (ACL):
Abhinav Goyal, Anupam Singh, and Nikesh Garera. 2022. End-to-End Speech to Intent Prediction to improve E-commerce Customer Support Voicebot in Hindi and English. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: Industry Track, pages 579–586, Abu Dhabi, UAE. Association for Computational Linguistics.
Cite (Informal):
End-to-End Speech to Intent Prediction to improve E-commerce Customer Support Voicebot in Hindi and English (Goyal et al., EMNLP 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.emnlp-industry.59.pdf