Debating Europe: A Multilingual Multi-Target Stance Classification Dataset of Online Debates

Valentin Barriere, Alexandra Balahur, Brian Ravenet


Abstract
We present a new dataset of online debates in English, annotated with stance. The dataset was scraped from the “Debating Europe” platform, where users exchange opinions over different subjects related to the European Union. The dataset is composed of 2600 comments pertaining to 18 debates related to the “European Green Deal”, in a conversational setting. After presenting the dataset and the annotated sub-part, we pre-train a model for a multilingual stance classification over the X-stance dataset before fine-tuning it over our dataset, and vice-versa. The fine-tuned models are shown to improve stance classification performance on each of the datasets, even though they have different languages, topics and targets. Subsequently, we propose to enhance the performances over “Debating Europe” with an interaction-aware model, taking advantage of the online debate structure of the platform. We also propose a semi-supervised self-training method to take advantage of the imbalanced and unlabeled data from the whole website, leading to a final improvement of accuracy by 3.4% over a Vanilla XLM-R model.
Anthology ID:
2022.politicalnlp-1.3
Volume:
Proceedings of the LREC 2022 workshop on Natural Language Processing for Political Sciences
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Haithem Afli, Mehwish Alam, Houda Bouamor, Cristina Blasi Casagran, Colleen Boland, Sahar Ghannay
Venue:
PoliticalNLP
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
16–21
Language:
URL:
https://aclanthology.org/2022.politicalnlp-1.3
DOI:
Bibkey:
Cite (ACL):
Valentin Barriere, Alexandra Balahur, and Brian Ravenet. 2022. Debating Europe: A Multilingual Multi-Target Stance Classification Dataset of Online Debates. In Proceedings of the LREC 2022 workshop on Natural Language Processing for Political Sciences, pages 16–21, Marseille, France. European Language Resources Association.
Cite (Informal):
Debating Europe: A Multilingual Multi-Target Stance Classification Dataset of Online Debates (Barriere et al., PoliticalNLP 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.politicalnlp-1.3.pdf
Data
x-stance