ALEM at CASE 2021 Task 1: Multilingual Text Classification on News Articles

Alaeddin Gürel, Emre Emin


Abstract
We participated CASE shared task in ACL-IJCNLP 2021. This paper is a summary of our experiments and ideas about this shared task. For each subtask we shared our approach, successful and failed methods and our thoughts about them. We submit our results once for every subtask, except for subtask3, in task submission system and present scores based on our validation set formed from given training samples in this paper. Techniques and models we mentioned includes BERT, Multilingual BERT, oversampling, undersampling, data augmentation and their implications with each other. Most of the experiments we came up with were not completed, as time did not permit, but we share them here as we plan to do them as suggested in the future work part of document.
Anthology ID:
2021.case-1.19
Volume:
Proceedings of the 4th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2021)
Month:
August
Year:
2021
Address:
Online
Venues:
ACL | CASE | IJCNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
147–151
Language:
URL:
https://aclanthology.org/2021.case-1.19
DOI:
10.18653/v1/2021.case-1.19
Bibkey:
Cite (ACL):
Alaeddin Gürel and Emre Emin. 2021. ALEM at CASE 2021 Task 1: Multilingual Text Classification on News Articles. In Proceedings of the 4th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2021), pages 147–151, Online. Association for Computational Linguistics.
Cite (Informal):
ALEM at CASE 2021 Task 1: Multilingual Text Classification on News Articles (Gürel & Emin, CASE 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.case-1.19.pdf