Stanceosaurus 2.0 - Classifying Stance Towards Russian and Spanish Misinformation

Anton Lavrouk; Ian Ligon; Jonathan Zheng; Tarek Naous; Wei Xu; Alan Ritter

doi:10.18653/v1/2024.wnut-1.4

Stanceosaurus 2.0 - Classifying Stance Towards Russian and Spanish Misinformation

Anton Lavrouk, Ian Ligon, Jonathan Zheng, Tarek Naous, Wei Xu, Alan Ritter

Abstract

The Stanceosaurus corpus (Zheng et al., 2022) was designed to provide high-quality, annotated, 5-way stance data extracted from Twitter, suitable for analyzing cross-cultural and cross-lingual misinformation. In the Stanceosaurus 2.0 iteration, we extend this framework to encompass Russian and Spanish. The former is of current significance due to prevalent misinformation amid escalating tensions with the West and the violent incursion into Ukraine. The latter, meanwhile, represents an enormous community that has been largely overlooked on major social media platforms. By incorporating an additional 3,874 Spanish and Russian tweets over 41 misinformation claims, our objective is to support research focused on these issues. To demonstrate the value of this data, we employed zero-shot cross-lingual transfer on multilingual BERT, yielding results on par with the initial Stanceosaurus study with a macro F1 score of 43 for both languages. This underlines the viability of stance classification as an effective tool for identifying multicultural misinformation.

Anthology ID:: 2024.wnut-1.4
Volume:: Proceedings of the Ninth Workshop on Noisy and User-generated Text (W-NUT 2024)
Month:: March
Year:: 2024
Address:: San Ġiljan, Malta
Editors:: Rob van der Goot, JinYeong Bak, Max Müller-Eberstein, Wei Xu, Alan Ritter, Tim Baldwin
Venues:: WNUT | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 31–43
Language:
URL:: https://aclanthology.org/2024.wnut-1.4/
DOI:: 10.18653/v1/2024.wnut-1.4
Bibkey:
Cite (ACL):: Anton Lavrouk, Ian Ligon, Jonathan Zheng, Tarek Naous, Wei Xu, and Alan Ritter. 2024. Stanceosaurus 2.0 - Classifying Stance Towards Russian and Spanish Misinformation. In Proceedings of the Ninth Workshop on Noisy and User-generated Text (W-NUT 2024), pages 31–43, San Ġiljan, Malta. Association for Computational Linguistics.
Cite (Informal):: Stanceosaurus 2.0 - Classifying Stance Towards Russian and Spanish Misinformation (Lavrouk et al., WNUT 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.wnut-1.4.pdf
Video:: https://aclanthology.org/2024.wnut-1.4.mp4

PDF Cite Search Video Fix data