Cross-Lingual Sentiment Analysis Without (Good) Translation

Mohamed Abdalla, Graeme Hirst


Abstract
Current approaches to cross-lingual sentiment analysis try to leverage the wealth of labeled English data using bilingual lexicons, bilingual vector space embeddings, or machine translation systems. Here we show that it is possible to use a single linear transformation, with as few as 2000 word pairs, to capture fine-grained sentiment relationships between words in a cross-lingual setting. We apply these cross-lingual sentiment models to a diverse set of tasks to demonstrate their functionality in a non-English context. By effectively leveraging English sentiment knowledge without the need for accurate translation, we can analyze and extract features from other languages with scarce data at a very low cost, thus making sentiment and related analyses for many languages inexpensive.
Anthology ID:
I17-1051
Volume:
Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
Month:
November
Year:
2017
Address:
Taipei, Taiwan
Editors:
Greg Kondrak, Taro Watanabe
Venue:
IJCNLP
SIG:
Publisher:
Asian Federation of Natural Language Processing
Note:
Pages:
506–515
Language:
URL:
https://aclanthology.org/I17-1051
DOI:
Bibkey:
Cite (ACL):
Mohamed Abdalla and Graeme Hirst. 2017. Cross-Lingual Sentiment Analysis Without (Good) Translation. In Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 506–515, Taipei, Taiwan. Asian Federation of Natural Language Processing.
Cite (Informal):
Cross-Lingual Sentiment Analysis Without (Good) Translation (Abdalla & Hirst, IJCNLP 2017)
Copy Citation:
PDF:
https://aclanthology.org/I17-1051.pdf