Creating a Dataset for Multilingual Fine-grained Emotion-detection Using Gamification-based Annotation

Emily Öhman, Kaisla Kajava, Jörg Tiedemann, Timo Honkela


Abstract
This paper introduces a gamified framework for fine-grained sentiment analysis and emotion detection. We present a flexible tool, Sentimentator, that can be used for efficient annotation based on crowd sourcing and a self-perpetuating gold standard. We also present a novel dataset with multi-dimensional annotations of emotions and sentiments in movie subtitles that enables research on sentiment preservation across languages and the creation of robust multilingual emotion detection tools. The tools and datasets are public and open-source and can easily be extended and applied for various purposes.
Anthology ID:
W18-6205
Volume:
Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis
Month:
October
Year:
2018
Address:
Brussels, Belgium
Editors:
Alexandra Balahur, Saif M. Mohammad, Veronique Hoste, Roman Klinger
Venue:
WASSA
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
24–30
Language:
URL:
https://aclanthology.org/W18-6205/
DOI:
10.18653/v1/W18-6205
Bibkey:
Cite (ACL):
Emily Öhman, Kaisla Kajava, Jörg Tiedemann, and Timo Honkela. 2018. Creating a Dataset for Multilingual Fine-grained Emotion-detection Using Gamification-based Annotation. In Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, pages 24–30, Brussels, Belgium. Association for Computational Linguistics.
Cite (Informal):
Creating a Dataset for Multilingual Fine-grained Emotion-detection Using Gamification-based Annotation (Öhman et al., WASSA 2018)
Copy Citation:
PDF:
https://aclanthology.org/W18-6205.pdf