Assessing Arabic Weblog Credibility via Deep Co-learning

Chadi Helwe, Shady Elbassuoni, Ayman Al Zaatari, Wassim El-Hajj


Abstract
Assessing the credibility of online content has garnered a lot of attention lately. We focus on one such type of online content, namely weblogs or blogs for short. Some recent work attempted the task of automatically assessing the credibility of blogs, typically via machine learning. However, in the case of Arabic blogs, there are hardly any datasets available that can be used to train robust machine learning models for this difficult task. To overcome the lack of sufficient training data, we propose deep co-learning, a semi-supervised end-to-end deep learning approach to assess the credibility of Arabic blogs. In deep co-learning, multiple weak deep neural network classifiers are trained using a small labeled dataset, and each using a different view of the data. Each one of these classifiers is then used to classify unlabeled data, and its prediction is used to train the other classifiers in a semi-supervised fashion. We evaluate our deep co-learning approach on an Arabic blogs dataset, and we report significant improvements in performance compared to many baselines including fully-supervised deep learning models as well as ensemble models.
Anthology ID:
W19-4614
Volume:
Proceedings of the Fourth Arabic Natural Language Processing Workshop
Month:
August
Year:
2019
Address:
Florence, Italy
Editors:
Wassim El-Hajj, Lamia Hadrich Belguith, Fethi Bougares, Walid Magdy, Imed Zitouni, Nadi Tomeh, Mahmoud El-Haj, Wajdi Zaghouani
Venue:
WANLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
130–136
Language:
URL:
https://aclanthology.org/W19-4614
DOI:
10.18653/v1/W19-4614
Bibkey:
Cite (ACL):
Chadi Helwe, Shady Elbassuoni, Ayman Al Zaatari, and Wassim El-Hajj. 2019. Assessing Arabic Weblog Credibility via Deep Co-learning. In Proceedings of the Fourth Arabic Natural Language Processing Workshop, pages 130–136, Florence, Italy. Association for Computational Linguistics.
Cite (Informal):
Assessing Arabic Weblog Credibility via Deep Co-learning (Helwe et al., WANLP 2019)
Copy Citation:
PDF:
https://aclanthology.org/W19-4614.pdf