DataBees at SemEval-2026 Task 9: Detecting Multilingual, Multicultural and Multievent Online Polarization

Tanisha Sriram; Sathvika Shankar; Sowmya Anand; Rajalakshmi Sivanaiah; Angel Deborah S; Mirnalinee Thankanadar

DataBees at SemEval-2026 Task 9: Detecting Multilingual, Multicultural and Multievent Online Polarization

Tanisha Sriram, Sathvika Shankar, Sowmya Anand, Rajalakshmi Sivanaiah, Angel Deborah S, Mirnalinee Thankanadar

Abstract

This paper describes our submission toSemEval-2026 Task 9, Subtask 1: Multilingual Text Classification Challenge — Polarization Detection. Our focus is on how classicaland transformer-based models compare whenapplied to multilingual polarization detection.We aim to understand where each type tendsto do well and where it breaks down, particularly once you move from high-resource tolow-resource settings. Our experimental setupevaluates classical machine learning models(TFIDF with Naive Bayes, Logistic Regression, and Linear SVM) alongside languagespecific transformer models across multiplelanguages. For Arabic, Bengali, German, Italian, and Spanish, we leveraged both multilingual and monolingual pre-trained transformers such as mBERT, XLM-R, AraBERTv2,BanglaBERT, and BETO. We compare individual classical and transformer-based modelsto identify which modeling choices work bestfor each language. Our results varied substantially across languages. We achieved our bestleaderboard rankings in Bengali (6th out of 48teams) and Italian (6th out of 43 teams), whileperformance was lower in Arabic (33rd out of44), German (41st out of 44), and Spanish (46thout of 48). The study highlights the value ofcomparing classical and transformer-based approaches for multilingual polarization detectionand identifies language-specific challenges forfuture improvement.

Anthology ID:: 2026.semeval-1.268
Volume:: Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Month:: July
Year:: 2026
Address:: San Diego, California, USA
Editors:: Ekaterina Kochmar, Debanjan Ghosh, Kai North, Mamoru Komachi
Venues:: SemEval | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2120–2125
Language:
URL:: https://aclanthology.org/2026.semeval-1.268/
DOI:
Bibkey:
Cite (ACL):: Tanisha Sriram, Sathvika Shankar, Sowmya Anand, Rajalakshmi Sivanaiah, Angel Deborah S, and Mirnalinee Thankanadar. 2026. DataBees at SemEval-2026 Task 9: Detecting Multilingual, Multicultural and Multievent Online Polarization. In Proceedings of the 20th International Workshop on Semantic Evaluation (2026), pages 2120–2125, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):: DataBees at SemEval-2026 Task 9: Detecting Multilingual, Multicultural and Multievent Online Polarization (Sriram et al., SemEval 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.semeval-1.268.pdf

PDF Cite Search Fix data