Investigating Adversarial Robustness in LLM-based AES

Renjith Ravindran; Ikkyu Choi

Investigating Adversarial Robustness in LLM-based AES

Abstract

Automated Essay Scoring (AES) is one of the most widely studied applications of Natural Language Processing (NLP) in education and educational measurement. Recent advances with pre-trained Transformer-based large language models (LLMs) have shifted AES from feature-based modeling to leveraging contextualized language representations. These models provide rich semantic representations that substantially improve scoring accuracy and human–machine agreement compared to systems relying on handcrafted features. However, their robustness towards adversarially crafted inputs remains poorly understood. In this study, we define adversarial input as any modification of the essay text designed to fool an automated scoring system into assigning an inflated score. We evaluate a fine-tuned DeBERTa-based AES model on such inputs and show that it is highly susceptible to a simple text duplication attack, highlighting the need to consider adversarial robustness alongside accuracy in the development of AES systems.

Anthology ID:: 2025.aimecon-sessions.10
Volume:: Proceedings of the Artificial Intelligence in Measurement and Education Conference (AIME-Con): Coordinated Session Papers
Month:: October
Year:: 2025
Address:: Wyndham Grand Pittsburgh, Downtown, Pittsburgh, Pennsylvania, United States
Editors:: Joshua Wilson, Christopher Ormerod, Magdalen Beiting Parrish
Venue:: AIME-Con
SIG:
Publisher:: National Council on Measurement in Education (NCME)
Note:
Pages:: 86–91
Language:
URL:: https://aclanthology.org/2025.aimecon-sessions.10/
DOI:
Bibkey:
Cite (ACL):: Renjith Ravindran and Ikkyu Choi. 2025. Investigating Adversarial Robustness in LLM-based AES. In Proceedings of the Artificial Intelligence in Measurement and Education Conference (AIME-Con): Coordinated Session Papers, pages 86–91, Wyndham Grand Pittsburgh, Downtown, Pittsburgh, Pennsylvania, United States. National Council on Measurement in Education (NCME).
Cite (Informal):: Investigating Adversarial Robustness in LLM-based AES (Ravindran & Choi, AIME-Con 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.aimecon-sessions.10.pdf

PDF Cite Search Fix data