KAFK at SemEval-2020 Task 12: Checkpoint Ensemble of Transformers for Hate Speech Classification

Kaushik Amar Das, Arup Baruah, Ferdous Ahmed Barbhuiya, Kuntal Dey


Abstract
This paper presents the approach of Team KAFK for the English edition of SemEval-2020 Task 12. We use checkpoint ensembling to create ensembles of BERT-based transformers and show that it can improve the performance of classification systems. We explore attention mask dropout to mitigate for the poor constructs of social media texts. Our classifiers scored macro-f1 of 0.909, 0.551 and 0.616 for subtasks A, B and C respectively. The code is publicly released online.
Anthology ID:
2020.semeval-1.267
Volume:
Proceedings of the Fourteenth Workshop on Semantic Evaluation
Month:
December
Year:
2020
Address:
Barcelona (online)
Editors:
Aurelie Herbelot, Xiaodan Zhu, Alexis Palmer, Nathan Schneider, Jonathan May, Ekaterina Shutova
Venue:
SemEval
SIG:
SIGLEX
Publisher:
International Committee for Computational Linguistics
Note:
Pages:
2023–2029
Language:
URL:
https://aclanthology.org/2020.semeval-1.267
DOI:
10.18653/v1/2020.semeval-1.267
Bibkey:
Cite (ACL):
Kaushik Amar Das, Arup Baruah, Ferdous Ahmed Barbhuiya, and Kuntal Dey. 2020. KAFK at SemEval-2020 Task 12: Checkpoint Ensemble of Transformers for Hate Speech Classification. In Proceedings of the Fourteenth Workshop on Semantic Evaluation, pages 2023–2029, Barcelona (online). International Committee for Computational Linguistics.
Cite (Informal):
KAFK at SemEval-2020 Task 12: Checkpoint Ensemble of Transformers for Hate Speech Classification (Das et al., SemEval 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.semeval-1.267.pdf
Code
 cozek/OffensEval2020-code