Cross-Cultural Transfer Learning for Text Classification

Dor Ringel, Gal Lavee, Ido Guy, Kira Radinsky


Abstract
Large training datasets are required to achieve competitive performance in most natural language tasks. The acquisition process for these datasets is labor intensive, expensive, and time consuming. This process is also prone to human errors. In this work, we show that cross-cultural differences can be harnessed for natural language text classification. We present a transfer-learning framework that leverages widely-available unaligned bilingual corpora for classification tasks, using no task-specific data. Our empirical evaluation on two tasks – formality classification and sarcasm detection – shows that the cross-cultural difference between German and American English, as manifested in product review text, can be applied to achieve good performance for formality classification, while the difference between Japanese and American English can be applied to achieve good performance for sarcasm detection – both without any task-specific labeled data.
Anthology ID:
D19-1400
Volume:
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
Month:
November
Year:
2019
Address:
Hong Kong, China
Editors:
Kentaro Inui, Jing Jiang, Vincent Ng, Xiaojun Wan
Venues:
EMNLP | IJCNLP
SIG:
SIGDAT
Publisher:
Association for Computational Linguistics
Note:
Pages:
3873–3883
Language:
URL:
https://aclanthology.org/D19-1400
DOI:
10.18653/v1/D19-1400
Bibkey:
Cite (ACL):
Dor Ringel, Gal Lavee, Ido Guy, and Kira Radinsky. 2019. Cross-Cultural Transfer Learning for Text Classification. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pages 3873–3883, Hong Kong, China. Association for Computational Linguistics.
Cite (Informal):
Cross-Cultural Transfer Learning for Text Classification (Ringel et al., EMNLP-IJCNLP 2019)
Copy Citation:
PDF:
https://aclanthology.org/D19-1400.pdf