Deep Learning Techniques for Humor Detection in Hindi-English Code-Mixed Tweets

Sushmitha Reddy Sane, Suraj Tripathi, Koushik Reddy Sane, Radhika Mamidi


Abstract
We propose bilingual word embeddings based on word2vec and fastText models (CBOW and Skip-gram) to address the problem of Humor detection in Hindi-English code-mixed tweets in combination with deep learning architectures. We focus on deep learning approaches which are not widely used on code-mixed data and analyzed their performance by experimenting with three different neural network models. We propose convolution neural network (CNN) and bidirectional long-short term memory (biLSTM) (with and without Attention) models which take the generated bilingual embeddings as input. We make use of Twitter data to create bilingual word embeddings. All our proposed architectures outperform the state-of-the-art results, and Attention-based bidirectional LSTM model achieved an accuracy of 73.6% which is an increment of more than 4% compared to the current state-of-the-art results.
Anthology ID:
W19-1307
Volume:
Proceedings of the Tenth Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis
Month:
June
Year:
2019
Address:
Minneapolis, USA
Venue:
WASSA
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
57–61
Language:
URL:
https://aclanthology.org/W19-1307
DOI:
10.18653/v1/W19-1307
Bibkey:
Cite (ACL):
Sushmitha Reddy Sane, Suraj Tripathi, Koushik Reddy Sane, and Radhika Mamidi. 2019. Deep Learning Techniques for Humor Detection in Hindi-English Code-Mixed Tweets. In Proceedings of the Tenth Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, pages 57–61, Minneapolis, USA. Association for Computational Linguistics.
Cite (Informal):
Deep Learning Techniques for Humor Detection in Hindi-English Code-Mixed Tweets (Sane et al., WASSA 2019)
Copy Citation:
PDF:
https://aclanthology.org/W19-1307.pdf