Improving Spoken Language Understanding by Wisdom of Crowds

Koichiro Yoshino, Kana Ikeuchi, Katsuhito Sudoh, Satoshi Nakamura


Abstract
Spoken language understanding (SLU), which converts user requests in natural language to machine-interpretable expressions, is becoming an essential task. The lack of training data is an important problem, especially for new system tasks, because existing SLU systems are based on statistical approaches. In this paper, we proposed to use two sources of the “wisdom of crowds,” crowdsourcing and knowledge community website, for improving the SLU system. We firstly collected paraphrasing variations for new system tasks through crowdsourcing as seed data, and then augmented them using similar questions from a knowledge community website. We investigated the effects of the proposed data augmentation method in SLU task, even with small seed data. In particular, the proposed architecture augmented more than 120,000 samples to improve SLU accuracies.
Anthology ID:
2020.coling-main.234
Volume:
Proceedings of the 28th International Conference on Computational Linguistics
Month:
December
Year:
2020
Address:
Barcelona, Spain (Online)
Editors:
Donia Scott, Nuria Bel, Chengqing Zong
Venue:
COLING
SIG:
Publisher:
International Committee on Computational Linguistics
Note:
Pages:
2606–2612
Language:
URL:
https://aclanthology.org/2020.coling-main.234
DOI:
10.18653/v1/2020.coling-main.234
Bibkey:
Cite (ACL):
Koichiro Yoshino, Kana Ikeuchi, Katsuhito Sudoh, and Satoshi Nakamura. 2020. Improving Spoken Language Understanding by Wisdom of Crowds. In Proceedings of the 28th International Conference on Computational Linguistics, pages 2606–2612, Barcelona, Spain (Online). International Committee on Computational Linguistics.
Cite (Informal):
Improving Spoken Language Understanding by Wisdom of Crowds (Yoshino et al., COLING 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.coling-main.234.pdf