SWEAT: Scoring Polarization of Topics across Different Corpora

Federico Bianchi, Marco Marelli, Paolo Nicoli, Matteo Palmonari


Abstract
Understanding differences of viewpoints across corpora is a fundamental task for computational social sciences. In this paper, we propose the Sliced Word Embedding Association Test (SWEAT), a novel statistical measure to compute the relative polarization of a topical wordset across two distributional representations. To this end, SWEAT uses two additional wordsets, deemed to have opposite valence, to represent two different poles. We validate our approach and illustrate a case study to show the usefulness of the introduced measure.
Anthology ID:
2021.emnlp-main.788
Volume:
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
Month:
November
Year:
2021
Address:
Online and Punta Cana, Dominican Republic
Editors:
Marie-Francine Moens, Xuanjing Huang, Lucia Specia, Scott Wen-tau Yih
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
10065–10072
Language:
URL:
https://aclanthology.org/2021.emnlp-main.788
DOI:
10.18653/v1/2021.emnlp-main.788
Bibkey:
Cite (ACL):
Federico Bianchi, Marco Marelli, Paolo Nicoli, and Matteo Palmonari. 2021. SWEAT: Scoring Polarization of Topics across Different Corpora. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 10065–10072, Online and Punta Cana, Dominican Republic. Association for Computational Linguistics.
Cite (Informal):
SWEAT: Scoring Polarization of Topics across Different Corpora (Bianchi et al., EMNLP 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.emnlp-main.788.pdf
Video:
 https://aclanthology.org/2021.emnlp-main.788.mp4
Code
 vinid/sweat