JRC at ClimateActivism 2024: Lexicon-based Detection of Hate Speech

Hristo Tanev


Abstract
In this paper we describe the participation of the JRC team in the Sub-task A: “Hate Speech Detection” in the Shared task on Hate Speech and Stance Detection during Climate Activism at the CASE 2024 workshop. Our system is purely lexicon (keyword) based and does not use any statistical classifier. The system ranked 18 out of 22 participants with F1 of 0.83, only one point below a system, based on LLM. Our system also obtained one the highest achieved precision scores among all participating algo- rithms.
Anthology ID:
2024.case-1.11
Volume:
Proceedings of the 7th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2024)
Month:
March
Year:
2024
Address:
St. Julians, Malta
Editors:
Ali Hürriyetoğlu, Hristo Tanev, Surendrabikram Thapa, Gökçe Uludoğan
Venues:
CASE | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
85–88
Language:
URL:
https://aclanthology.org/2024.case-1.11
DOI:
Bibkey:
Cite (ACL):
Hristo Tanev. 2024. JRC at ClimateActivism 2024: Lexicon-based Detection of Hate Speech. In Proceedings of the 7th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2024), pages 85–88, St. Julians, Malta. Association for Computational Linguistics.
Cite (Informal):
JRC at ClimateActivism 2024: Lexicon-based Detection of Hate Speech (Tanev, CASE-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.case-1.11.pdf
Supplementary material:
 2024.case-1.11.SupplementaryMaterial.txt