An Examination of Cross-Cultural Similarities and Differences from Social Media Data with respect to Language Use

Mohammad Fazleh Elahi, Paola Monachesi


Abstract
We present a methodology for analyzing cross-cultural similarities and differences using language as a medium, love as domain, social media as a data source and 'Terms' and 'Topics' as cultural features. We discuss the techniques necessary for the creation of the social data corpus from which emotion terms have been extracted using NLP techniques. Topics of love discussion were then extracted from the corpus by means of Latent Dirichlet Allocation (LDA). Finally, on the basis of these features, a cross-cultural comparison was carried out. For the purpose of cross-cultural analysis, the experimental focus was on comparing data from a culture from the East (India) with a culture from the West (United States of America). Similarities and differences between these cultures have been analyzed with respect to the usage of emotions, their intensities and the topics used during love discussion in social media.
Anthology ID:
L12-1561
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
4080–4086
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/942_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Mohammad Fazleh Elahi and Paola Monachesi. 2012. An Examination of Cross-Cultural Similarities and Differences from Social Media Data with respect to Language Use. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 4080–4086, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
An Examination of Cross-Cultural Similarities and Differences from Social Media Data with respect to Language Use (Elahi & Monachesi, LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/942_Paper.pdf