Aschern at SemEval-2020 Task 11: It Takes Three to Tango: RoBERTa, CRF, and Transfer Learning

Anton Chernyavskiy, Dmitry Ilvovsky, Preslav Nakov


Abstract
We describe our system for SemEval-2020 Task 11 on Detection of Propaganda Techniques in News Articles. We developed ensemble models using RoBERTa-based neural architectures, additional CRF layers, transfer learning between the two subtasks, and advanced post-processing to handle the multi-label nature of the task, the consistency between nested spans, repetitions, and labels from similar spans in training. We achieved sizable improvements over baseline fine-tuned RoBERTa models, and the official evaluation ranked our system 3rd (almost tied with the 2nd) out of 36 teams on the span identification subtask with an F1 score of 0.491, and 2nd (almost tied with the 1st) out of 31 teams on the technique classification subtask with an F1 score of 0.62.
Anthology ID:
2020.semeval-1.191
Volume:
Proceedings of the Fourteenth Workshop on Semantic Evaluation
Month:
December
Year:
2020
Address:
Barcelona (online)
Venue:
SemEval
SIGs:
SIGLEX | SIGSEM
Publisher:
International Committee for Computational Linguistics
Note:
Pages:
1462–1468
Language:
URL:
https://aclanthology.org/2020.semeval-1.191
DOI:
10.18653/v1/2020.semeval-1.191
Bibkey:
Cite (ACL):
Anton Chernyavskiy, Dmitry Ilvovsky, and Preslav Nakov. 2020. Aschern at SemEval-2020 Task 11: It Takes Three to Tango: RoBERTa, CRF, and Transfer Learning. In Proceedings of the Fourteenth Workshop on Semantic Evaluation, pages 1462–1468, Barcelona (online). International Committee for Computational Linguistics.
Cite (Informal):
Aschern at SemEval-2020 Task 11: It Takes Three to Tango: RoBERTa, CRF, and Transfer Learning (Chernyavskiy et al., SemEval 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.semeval-1.191.pdf
Code
 aschern/semeval2020_task11