Automated Essay Scoring in the Presence of Biased Ratings

Evelin Amorim; Marcia Cançado; Adriano Veloso

doi:10.18653/v1/N18-1021

Automated Essay Scoring in the Presence of Biased Ratings

Evelin Amorim, Marcia Cançado, Adriano Veloso

Abstract

Studies in Social Sciences have revealed that when people evaluate someone else, their evaluations often reflect their biases. As a result, rater bias may introduce highly subjective factors that make their evaluations inaccurate. This may affect automated essay scoring models in many ways, as these models are typically designed to model (potentially biased) essay raters. While there is sizeable literature on rater effects in general settings, it remains unknown how rater bias affects automated essay scoring. To this end, we present a new annotated corpus containing essays and their respective scores. Different from existing corpora, our corpus also contains comments provided by the raters in order to ground their scores. We present features to quantify rater bias based on their comments, and we found that rater bias plays an important role in automated essay scoring. We investigated the extent to which rater bias affects models based on hand-crafted features. Finally, we propose to rectify the training set by removing essays associated with potentially biased scores while learning the scoring model.

Anthology ID:: N18-1021
Volume:: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
Month:: June
Year:: 2018
Address:: New Orleans, Louisiana
Editors:: Marilyn Walker, Heng Ji, Amanda Stent
Venue:: NAACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 229–237
Language:
URL:: https://aclanthology.org/N18-1021/
DOI:: 10.18653/v1/N18-1021
Bibkey:
Cite (ACL):: Evelin Amorim, Marcia Cançado, and Adriano Veloso. 2018. Automated Essay Scoring in the Presence of Biased Ratings. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pages 229–237, New Orleans, Louisiana. Association for Computational Linguistics.
Cite (Informal):: Automated Essay Scoring in the Presence of Biased Ratings (Amorim et al., NAACL 2018)
Copy Citation:
PDF:: https://aclanthology.org/N18-1021.pdf

PDF Cite Search Fix data