Hey Siri. Ok Google. Alexa: A topic modeling of user reviews for smart speakers

Hanh Nguyen, Dirk Hovy


Abstract
User reviews provide a significant source of information for companies to understand their market and audience. In order to discover broad trends in this source, researchers have typically used topic models such as Latent Dirichlet Allocation (LDA). However, while there are metrics to choose the “best” number of topics, it is not clear whether the resulting topics can also provide in-depth, actionable product analysis. Our paper examines this issue by analyzing user reviews from the Best Buy US website for smart speakers. Using coherence scores to choose topics, we test whether the results help us to understand user interests and concerns. We find that while coherence scores are a good starting point to identify a number of topics, it still requires manual adaptation based on domain knowledge to provide market insights. We show that the resulting dimensions capture brand performance and differences, and differentiate the market into two distinct groups with different properties.
Anthology ID:
D19-5510
Volume:
Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT 2019)
Month:
November
Year:
2019
Address:
Hong Kong, China
Editors:
Wei Xu, Alan Ritter, Tim Baldwin, Afshin Rahimi
Venue:
WNUT
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
76–83
Language:
URL:
https://aclanthology.org/D19-5510
DOI:
10.18653/v1/D19-5510
Bibkey:
Cite (ACL):
Hanh Nguyen and Dirk Hovy. 2019. Hey Siri. Ok Google. Alexa: A topic modeling of user reviews for smart speakers. In Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT 2019), pages 76–83, Hong Kong, China. Association for Computational Linguistics.
Cite (Informal):
Hey Siri. Ok Google. Alexa: A topic modeling of user reviews for smart speakers (Nguyen & Hovy, WNUT 2019)
Copy Citation:
PDF:
https://aclanthology.org/D19-5510.pdf