Think Like a Person Before Responding: A Multi-Faceted Evaluation of Persona-Guided LLMs for Countering Hate Speech.

Mikel Ngueajio; Flor Miriam Plaza-del-Arco; Yi-Ling Chung; Danda Rawat; Amanda Cercas Curry

Think Like a Person Before Responding: A Multi-Faceted Evaluation of Persona-Guided LLMs for Countering Hate Speech.

Mikel Ngueajio, Flor Miriam Plaza-del-Arco, Yi-Ling Chung, Danda Rawat, Amanda Cercas Curry

Abstract

Automated counter-narratives (CN) offer a promising strategy for mitigating online hate speech, yet concerns about their affective tone, accessibility, and ethical risks remain. We propose a framework for evaluating Large Language Model (LLM)-generated CNs across four dimensions: persona framing, verbosity and readability, affective tone, and ethical robustness. Using GPT-4o-Mini, Cohere’s CommandR-7B, and Meta’s LLaMA 3.1-70B, we assess three prompting strategies on the MT-Conan and HatEval datasets.Our findings reveal that LLM-generated CNs are often verbose and adapted for people with college-level literacy, limiting their accessibility. While emotionally guided prompts yield more empathetic and readable responses, there remain concerns surrounding safety and effectiveness.

Anthology ID:: 2025.woah-1.10
Volume:: Proceedings of the The 9th Workshop on Online Abuse and Harms (WOAH)
Month:: August
Year:: 2025
Address:: Vienna, Austria
Editors:: Agostina Calabrese, Christine de Kock, Debora Nozza, Flor Miriam Plaza-del-Arco, Zeerak Talat, Francielle Vargas
Venues:: WOAH | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 104–123
Language:
URL:: https://aclanthology.org/2025.woah-1.10/
DOI:
Bibkey:
Cite (ACL):: Mikel Ngueajio, Flor Miriam Plaza-del-Arco, Yi-Ling Chung, Danda Rawat, and Amanda Cercas Curry. 2025. Think Like a Person Before Responding: A Multi-Faceted Evaluation of Persona-Guided LLMs for Countering Hate Speech.. In Proceedings of the The 9th Workshop on Online Abuse and Harms (WOAH), pages 104–123, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Think Like a Person Before Responding: A Multi-Faceted Evaluation of Persona-Guided LLMs for Countering Hate Speech. (Ngueajio et al., WOAH 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.woah-1.10.pdf
Supplementarymaterial:: 2025.woah-1.10.SupplementaryMaterial.zip

PDF Cite Search Supplementarymaterial Fix data