Sentiment Analysis for Low Resource Languages: A Study on Informal Indonesian Tweets

Tuan Anh Le, David Moeljadi, Yasuhide Miura, Tomoko Ohkuma


Abstract
This paper describes our attempt to build a sentiment analysis system for Indonesian tweets. With this system, we can study and identify sentiments and opinions in a text or document computationally. We used four thousand manually labeled tweets collected in February and March 2016 to build the model. Because of the variety of content in tweets, we analyze tweets into eight groups in total, including pos(itive), neg(ative), and neu(tral). Finally, we obtained 73.2% accuracy with Long Short Term Memory (LSTM) without normalizer.
Anthology ID:
W16-5415
Volume:
Proceedings of the 12th Workshop on Asian Language Resources (ALR12)
Month:
December
Year:
2016
Address:
Osaka, Japan
Editors:
Koiti Hasida, Kam-Fai Wong, Nicoletta Calzorari, Key-Sun Choi
Venue:
ALR
SIG:
Publisher:
The COLING 2016 Organizing Committee
Note:
Pages:
123–131
Language:
URL:
https://aclanthology.org/W16-5415
DOI:
Bibkey:
Cite (ACL):
Tuan Anh Le, David Moeljadi, Yasuhide Miura, and Tomoko Ohkuma. 2016. Sentiment Analysis for Low Resource Languages: A Study on Informal Indonesian Tweets. In Proceedings of the 12th Workshop on Asian Language Resources (ALR12), pages 123–131, Osaka, Japan. The COLING 2016 Organizing Committee.
Cite (Informal):
Sentiment Analysis for Low Resource Languages: A Study on Informal Indonesian Tweets (Le et al., ALR 2016)
Copy Citation:
PDF:
https://aclanthology.org/W16-5415.pdf