Large Language Models as Detectors or Instigators of Hate Speech in Low-resource Ethiopian Languages

Nuhu Ibrahim, Felicity Mulford, Riza Batista-Navarro


Abstract
We introduce a multilingual benchmark for evaluating large language models (LLMs) on hate speech detection and generation in low-resource Ethiopian languages: Afaan Oromo, Amharic and Tigrigna, and English (both monolingual and code-mixed). Using a balanced and expert-annotated dataset, we assess five state-of-the-art LLM families across both tasks. Our results show that while LLMs perform well on English detection, their performance on low-resource languages is significantly weaker, revealing that increasing model size alone does not ensure multilingual robustness. More critically, we find that all models, including closed and open-source variants, can be prompted to generate profiled hate speech with minimal resistance. These findings underscore the dual risk of exclusion and exploitation: LLMs fail to protect low-resource communities while enabling scalable harm against them. We make our evaluation framework available to facilitate future research on multilingual model safety and ethical robustness.
Anthology ID:
2025.winlp-main.31
Volume:
Proceedings of the 9th Widening NLP Workshop
Month:
November
Year:
2025
Address:
Suzhou, China
Editors:
Chen Zhang, Emily Allaway, Hua Shen, Lesly Miculicich, Yinqiao Li, Meryem M'hamdi, Peerat Limkonchotiwat, Richard He Bai, Santosh T.y.s.s., Sophia Simeng Han, Surendrabikram Thapa, Wiem Ben Rim
Venues:
WiNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
197–209
Language:
URL:
https://aclanthology.org/2025.winlp-main.31/
DOI:
Bibkey:
Cite (ACL):
Nuhu Ibrahim, Felicity Mulford, and Riza Batista-Navarro. 2025. Large Language Models as Detectors or Instigators of Hate Speech in Low-resource Ethiopian Languages. In Proceedings of the 9th Widening NLP Workshop, pages 197–209, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):
Large Language Models as Detectors or Instigators of Hate Speech in Low-resource Ethiopian Languages (Ibrahim et al., WiNLP 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.winlp-main.31.pdf