Peperomia at SemEval-2018 Task 2: Vector Similarity Based Approach for Emoji Prediction

Jing Chen, Dechuan Yang, Xilian Li, Wei Chen, Tengjiao Wang


Abstract
This paper describes our participation in SemEval 2018 Task 2: Multilingual Emoji Prediction, in which participants are asked to predict a tweet’s most associated emoji from 20 emojis. Instead of regarding it as a 20-class classification problem we regard it as a text similarity problem. We propose a vector similarity based approach for this task. First the distributed representation (tweet vector) for each tweet is generated, then the similarity between this tweet vector and each emoji’s embedding is evaluated. The most similar emoji is chosen as the predicted label. Experimental results show that our approach performs comparably with the classification approach and shows its advantage in classifying emojis with similar semantic meaning.
Anthology ID:
S18-1067
Volume:
Proceedings of the 12th International Workshop on Semantic Evaluation
Month:
June
Year:
2018
Address:
New Orleans, Louisiana
Editors:
Marianna Apidianaki, Saif M. Mohammad, Jonathan May, Ekaterina Shutova, Steven Bethard, Marine Carpuat
Venue:
SemEval
SIG:
SIGLEX
Publisher:
Association for Computational Linguistics
Note:
Pages:
428–432
Language:
URL:
https://aclanthology.org/S18-1067
DOI:
10.18653/v1/S18-1067
Bibkey:
Cite (ACL):
Jing Chen, Dechuan Yang, Xilian Li, Wei Chen, and Tengjiao Wang. 2018. Peperomia at SemEval-2018 Task 2: Vector Similarity Based Approach for Emoji Prediction. In Proceedings of the 12th International Workshop on Semantic Evaluation, pages 428–432, New Orleans, Louisiana. Association for Computational Linguistics.
Cite (Informal):
Peperomia at SemEval-2018 Task 2: Vector Similarity Based Approach for Emoji Prediction (Chen et al., SemEval 2018)
Copy Citation:
PDF:
https://aclanthology.org/S18-1067.pdf