Deep Learning for Social Media Health Text Classification

Santosh Tokala, Vaibhav Gambhir, Animesh Mukherjee


Abstract
This paper describes the systems developed for 1st and 2nd tasks of the 3rd Social Media Mining for Health Applications Shared Task at EMNLP 2018. The first task focuses on automatic detection of posts mentioning a drug name or dietary supplement, a binary classification. The second task is about distinguishing the tweets that present personal medication intake, possible medication intake and non-intake. We performed extensive experiments with various classifiers like Logistic Regression, Random Forest, SVMs, Gradient Boosted Decision Trees (GBDT) and deep learning architectures such as Long Short-Term Memory Networks (LSTM), jointed Convolutional Neural Networks (CNN) and LSTM architecture, and attention based LSTM architecture both at word and character level. We have also explored using various pre-trained embeddings like Global Vectors for Word Representation (GloVe), Word2Vec and task-specific embeddings learned using CNN-LSTM and LSTMs.
Anthology ID:
W18-5917
Volume:
Proceedings of the 2018 EMNLP Workshop SMM4H: The 3rd Social Media Mining for Health Applications Workshop & Shared Task
Month:
October
Year:
2018
Address:
Brussels, Belgium
Editors:
Graciela Gonzalez-Hernandez, Davy Weissenbacher, Abeed Sarker, Michael Paul
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
61–64
Language:
URL:
https://aclanthology.org/W18-5917
DOI:
10.18653/v1/W18-5917
Bibkey:
Cite (ACL):
Santosh Tokala, Vaibhav Gambhir, and Animesh Mukherjee. 2018. Deep Learning for Social Media Health Text Classification. In Proceedings of the 2018 EMNLP Workshop SMM4H: The 3rd Social Media Mining for Health Applications Workshop & Shared Task, pages 61–64, Brussels, Belgium. Association for Computational Linguistics.
Cite (Informal):
Deep Learning for Social Media Health Text Classification (Tokala et al., EMNLP 2018)
Copy Citation:
PDF:
https://aclanthology.org/W18-5917.pdf