UTMN at SemEval-2020 Task 11: A Kitchen Solution to Automatic Propaganda Detection

Elena Mikhalkova, Nadezhda Ganzherli, Anna Glazkova, Yuliya Bidulya


Abstract
The article describes a fast solution to propaganda detection at SemEval-2020 Task 11, based on feature adjustment. We use per-token vectorization of features and a simple Logistic Regression classifier to quickly test different hypotheses about our data. We come up with what seems to us the best solution, however, we are unable to align it with the result of the metric suggested by the organizers of the task. We test how our system handles class and feature imbalance by varying the number of samples of two classes (Propaganda and None) in the training set, the size of a context window in which a token is vectorized and combination of vectorization means. The result of our system at SemEval2020 Task 11 is F-score=0.37.
Anthology ID:
2020.semeval-1.245
Volume:
Proceedings of the Fourteenth Workshop on Semantic Evaluation
Month:
December
Year:
2020
Address:
Barcelona (online)
Editors:
Aurelie Herbelot, Xiaodan Zhu, Alexis Palmer, Nathan Schneider, Jonathan May, Ekaterina Shutova
Venue:
SemEval
SIG:
SIGLEX
Publisher:
International Committee for Computational Linguistics
Note:
Pages:
1858–1864
Language:
URL:
https://aclanthology.org/2020.semeval-1.245
DOI:
10.18653/v1/2020.semeval-1.245
Bibkey:
Cite (ACL):
Elena Mikhalkova, Nadezhda Ganzherli, Anna Glazkova, and Yuliya Bidulya. 2020. UTMN at SemEval-2020 Task 11: A Kitchen Solution to Automatic Propaganda Detection. In Proceedings of the Fourteenth Workshop on Semantic Evaluation, pages 1858–1864, Barcelona (online). International Committee for Computational Linguistics.
Cite (Informal):
UTMN at SemEval-2020 Task 11: A Kitchen Solution to Automatic Propaganda Detection (Mikhalkova et al., SemEval 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.semeval-1.245.pdf
Code
 evrog/Propaganda-Detection-at-SemEval-2020