Product Classification in E-Commerce using Distributional Semantics

Vivek Gupta, Harish Karnick, Ashendra Bansal, Pradhuman Jhala


Abstract
Product classification is the task of automatically predicting a taxonomy path for a product in a predefined taxonomy hierarchy given a textual product description or title. For efficient product classification we require a suitable representation for a document (the textual description of a product) feature vector and efficient and fast algorithms for prediction. To address the above challenges, we propose a new distributional semantics representation for document vector formation. We also develop a new two-level ensemble approach utilising (with respect to the taxonomy tree) path-wise, node-wise and depth-wise classifiers to reduce error in the final product classification task. Our experiments show the effectiveness of the distributional representation and the ensemble approach on data sets from a leading e-commerce platform and achieve improved results on various evaluation metrics compared to earlier approaches.
Anthology ID:
C16-1052
Volume:
Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers
Month:
December
Year:
2016
Address:
Osaka, Japan
Editors:
Yuji Matsumoto, Rashmi Prasad
Venue:
COLING
SIG:
Publisher:
The COLING 2016 Organizing Committee
Note:
Pages:
536–546
Language:
URL:
https://aclanthology.org/C16-1052
DOI:
Bibkey:
Cite (ACL):
Vivek Gupta, Harish Karnick, Ashendra Bansal, and Pradhuman Jhala. 2016. Product Classification in E-Commerce using Distributional Semantics. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pages 536–546, Osaka, Japan. The COLING 2016 Organizing Committee.
Cite (Informal):
Product Classification in E-Commerce using Distributional Semantics (Gupta et al., COLING 2016)
Copy Citation:
PDF:
https://aclanthology.org/C16-1052.pdf