LINUS@EEUCA 2026: Fine-grained Toxicity Detection in Gaming Chat using Multilingual Transformers

Prajwal Ghimire; Aashish Mahato; Sunil Regmi

LINUS@EEUCA 2026: Fine-grained Toxicity Detection in Gaming Chat using Multilingual Transformers

Prajwal Ghimire, Aashish Mahato, Sunil Regmi

Abstract

The detection of toxic behavior in online gaming communities is crucial for maintaining safe digital spaces, yet remains challenging due to subtle context-dependent and intent-driven language. The GameTox dataset consists of around 53K World of Tanks chat utterances annotated across six categories: Non-toxic, Insults and Flaming, Other Offensive Texts, Hate and Harassment, Threats, and Extremism (CITATION). Our best performing approach, across multiple transformer-based architecture experimentations, is based on the multilingual BERT variant mmBERT-base fine-tuned with class-weighted cross-entropy loss. The best mmBERT-base model achieved a Macro F1 of 0.5882 during validation and an official test Macro F1 of 0.5104 on the shared task leaderboard. An internal held-out evaluation on a development split yielded 0.4282, which we analyze to understand distributional sensitivity to gaming slang and class imbalance. The code is available at: https://github.com/sunilRegmi-ai/eeuca-toxicity-detection.

Anthology ID:: 2026.eeuca-1.24
Volume:: Proceedings of the 9th Workshop on Event Extraction and Understanding: Challenges and Applications (EEUCA 2026)
Month:: July
Year:: 2026
Address:: San Diego, California, USA
Editors:: Ali Hürriyetoğlu, Surendrabikram Thapa, Hristo Tanev
Venues:: EEUCA | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 216–222
Language:
URL:: https://aclanthology.org/2026.eeuca-1.24/
DOI:
Bibkey:
Cite (ACL):: Prajwal Ghimire, Aashish Mahato, and Sunil Regmi. 2026. LINUS@EEUCA 2026: Fine-grained Toxicity Detection in Gaming Chat using Multilingual Transformers. In Proceedings of the 9th Workshop on Event Extraction and Understanding: Challenges and Applications (EEUCA 2026), pages 216–222, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):: LINUS@EEUCA 2026: Fine-grained Toxicity Detection in Gaming Chat using Multilingual Transformers (Ghimire et al., EEUCA 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.eeuca-1.24.pdf

PDF Cite Search Fix data