BanglaNLP at BLP-2023 Task 1: Benchmarking different Transformer Models for Violence Inciting Text Detection in Bangla

Saumajit Saha, Albert Nanda


Abstract
This paper presents the system that we have developed while solving this shared task on violence inciting text detection in Bangla. We explain both the traditional and the recent approaches that we have used to make our models learn. Our proposed system helps to classify if the given text contains any threat. We studied the impact of data augmentation when there is a limited dataset available. Our quantitative results show that finetuning a multilingual-e5-base model performed the best in our task compared to other transformer-based architectures. We obtained a macro F1 of 68.11% in the test set and our performance in this shared task is ranked at 23 in the leaderboard.
Anthology ID:
2023.banglalp-1.17
Volume:
Proceedings of the First Workshop on Bangla Language Processing (BLP-2023)
Month:
December
Year:
2023
Address:
Singapore
Editors:
Firoj Alam, Sudipta Kar, Shammur Absar Chowdhury, Farig Sadeque, Ruhul Amin
Venue:
BanglaLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
163–167
Language:
URL:
https://aclanthology.org/2023.banglalp-1.17
DOI:
10.18653/v1/2023.banglalp-1.17
Bibkey:
Cite (ACL):
Saumajit Saha and Albert Nanda. 2023. BanglaNLP at BLP-2023 Task 1: Benchmarking different Transformer Models for Violence Inciting Text Detection in Bangla. In Proceedings of the First Workshop on Bangla Language Processing (BLP-2023), pages 163–167, Singapore. Association for Computational Linguistics.
Cite (Informal):
BanglaNLP at BLP-2023 Task 1: Benchmarking different Transformer Models for Violence Inciting Text Detection in Bangla (Saha & Nanda, BanglaLP 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.banglalp-1.17.pdf