Large Language Models as Detectors or Instigators of Hate Speech in Low-resource Ethiopian Languages

Nuhu Ibrahim; Felicity Mulford; Riza Theresa Batista-Navarro

doi:10.18653/v1/2025.winlp-main.31

Large Language Models as Detectors or Instigators of Hate Speech in Low-resource Ethiopian Languages

Nuhu Ibrahim, Felicity Mulford, Riza Batista-Navarro

Abstract

We introduce a multilingual benchmark for evaluating large language models (LLMs) on hate speech detection and generation in low-resource Ethiopian languages: Afaan Oromo, Amharic and Tigrigna, and English (both monolingual and code-mixed). Using a balanced and expert-annotated dataset, we assess five state-of-the-art LLM families across both tasks. Our results show that while LLMs perform well on English detection, their performance on low-resource languages is significantly weaker, revealing that increasing model size alone does not ensure multilingual robustness. More critically, we find that all models, including closed and open-source variants, can be prompted to generate profiled hate speech with minimal resistance. These findings underscore the dual risk of exclusion and exploitation: LLMs fail to protect low-resource communities while enabling scalable harm against them. We make our evaluation framework available to facilitate future research on multilingual model safety and ethical robustness.

Anthology ID:: 2025.winlp-main.31
Volume:: Proceedings of the 9th Widening NLP Workshop
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Chen Zhang, Emily Allaway, Hua Shen, Lesly Miculicich, Yinqiao Li, Meryem M'hamdi, Peerat Limkonchotiwat, Richard He Bai, Santosh T.y.s.s., Sophia Simeng Han, Surendrabikram Thapa, Wiem Ben Rim
Venues:: WiNLP | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 197–209
Language:
URL:: https://aclanthology.org/2025.winlp-main.31/
DOI:: 10.18653/v1/2025.winlp-main.31
Bibkey:
Cite (ACL):: Nuhu Ibrahim, Felicity Mulford, and Riza Batista-Navarro. 2025. Large Language Models as Detectors or Instigators of Hate Speech in Low-resource Ethiopian Languages. In Proceedings of the 9th Widening NLP Workshop, pages 197–209, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Large Language Models as Detectors or Instigators of Hate Speech in Low-resource Ethiopian Languages (Ibrahim et al., WiNLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.winlp-main.31.pdf

PDF Cite Search Fix data