Word Embeddings and Convolutional Neural Network for Arabic Sentiment Classification

Abdelghani Dahou, Shengwu Xiong, Junwei Zhou, Mohamed Houcine Haddoud, Pengfei Duan


Abstract
With the development and the advancement of social networks, forums, blogs and online sales, a growing number of Arabs are expressing their opinions on the web. In this paper, a scheme of Arabic sentiment classification, which evaluates and detects the sentiment polarity from Arabic reviews and Arabic social media, is studied. We investigated in several architectures to build a quality neural word embeddings using a 3.4 billion words corpus from a collected 10 billion words web-crawled corpus. Moreover, a convolutional neural network trained on top of pre-trained Arabic word embeddings is used for sentiment classification to evaluate the quality of these word embeddings. The simulation results show that the proposed scheme outperforms the existed methods on 4 out of 5 balanced and unbalanced datasets.
Anthology ID:
C16-1228
Volume:
Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers
Month:
December
Year:
2016
Address:
Osaka, Japan
Editors:
Yuji Matsumoto, Rashmi Prasad
Venue:
COLING
SIG:
Publisher:
The COLING 2016 Organizing Committee
Note:
Pages:
2418–2427
Language:
URL:
https://aclanthology.org/C16-1228
DOI:
Bibkey:
Cite (ACL):
Abdelghani Dahou, Shengwu Xiong, Junwei Zhou, Mohamed Houcine Haddoud, and Pengfei Duan. 2016. Word Embeddings and Convolutional Neural Network for Arabic Sentiment Classification. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pages 2418–2427, Osaka, Japan. The COLING 2016 Organizing Committee.
Cite (Informal):
Word Embeddings and Convolutional Neural Network for Arabic Sentiment Classification (Dahou et al., COLING 2016)
Copy Citation:
PDF:
https://aclanthology.org/C16-1228.pdf
Code
 dahouabdelghani/arabic_word_embeddings_CNN
Data
ASTDLABR