Can Large Language Models Differentiate Harmful from Argumentative Essays? Steps Toward Ethical Essay Scoring

Hongjin Kim; Jeonghyun Kang; Harksoo Kim

Can Large Language Models Differentiate Harmful from Argumentative Essays? Steps Toward Ethical Essay Scoring

Hongjin Kim, Jeonghyun Kang, Harksoo Kim

Abstract

This study addresses critical gaps in Automatic Essay Scoring (AES) systems and Large Language Models (LLMs) with regard to their ability to effectively identify and score harmful essays. Despite advancements in AES technology, current models often overlook ethically and morally problematic elements within essays, erroneously assigning high scores to essays that may propagate harmful opinions. In this study, we introduce the Harmful Essay Detection (HED) benchmark, which includes essays integrating sensitive topics such as racism and gender bias, to test the efficacy of various LLMs in recognizing and scoring harmful content. Our findings reveal that: (1) LLMs require further enhancement to accurately distinguish between harmful and argumentative essays, and (2) both current AES models and LLMs fail to consider the ethical dimensions of content during scoring. The study underscores the need for developing more robust AES systems that are sensitive to the ethical implications of the content they are scoring.

Anthology ID:: 2025.coling-main.541
Volume:: Proceedings of the 31st International Conference on Computational Linguistics
Month:: January
Year:: 2025
Address:: Abu Dhabi, UAE
Editors:: Owen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa, Barbara Di Eugenio, Steven Schockaert
Venue:: COLING
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 8121–8147
Language:
URL:: https://aclanthology.org/2025.coling-main.541/
DOI:
Bibkey:
Cite (ACL):: Hongjin Kim, Jeonghyun Kang, and Harksoo Kim. 2025. Can Large Language Models Differentiate Harmful from Argumentative Essays? Steps Toward Ethical Essay Scoring. In Proceedings of the 31st International Conference on Computational Linguistics, pages 8121–8147, Abu Dhabi, UAE. Association for Computational Linguistics.
Cite (Informal):: Can Large Language Models Differentiate Harmful from Argumentative Essays? Steps Toward Ethical Essay Scoring (Kim et al., COLING 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.coling-main.541.pdf

PDF Cite Search Fix data