Predictive Modeling of Human Developers’ Evaluative Judgment of Generated Code as a Decision Process

Sergey Kovalchuk; Yanyu Li; Dmitriy Fedrushkov

doi:10.18653/v1/2025.hcinlp-1.10

Predictive Modeling of Human Developers’ Evaluative Judgment of Generated Code as a Decision Process

Sergey Kovalchuk, Yanyu Li, Dmitriy Fedrushkov

Abstract

The paper presents early results in the development of an approach to predictive modeling of human developer perceiving of code generated in question-answering scenarios with Large Language Model (LLM) applications. The study is focused on building a model for the description and prediction of human implicit behavior during evaluative judgment of generated code through evaluation of its consistency, correctness, and usefulness as subjective perceiving characteristics. We used Markov Decision Process (MDP) as a basic framework to describe the human developer and his/her perceiving. We consider two approaches (regression-based and classification-based) to identify MDP parameters so it can be used as an “artificial” developer for human-centered code evaluation. An experimental evaluation of the proposed approach was performed with survey data previously collected for several code generation LLMs in a question-answering scenario. The results show overall good performance of the proposed model in acceptance rate prediction (accuracy 0.82) and give promising perspectives for further development and application.

Anthology ID:: 2025.hcinlp-1.10
Volume:: Proceedings of the Fourth Workshop on Bridging Human-Computer Interaction and Natural Language Processing (HCI+NLP)
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Su Lin Blodgett, Amanda Cercas Curry, Sunipa Dev, Siyan Li, Michael Madaio, Jack Wang, Sherry Tongshuang Wu, Ziang Xiao, Diyi Yang
Venues:: HCINLP | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 120–128
Language:
URL:: https://aclanthology.org/2025.hcinlp-1.10/
DOI:: 10.18653/v1/2025.hcinlp-1.10
Bibkey:
Cite (ACL):: Sergey Kovalchuk, Yanyu Li, and Dmitriy Fedrushkov. 2025. Predictive Modeling of Human Developers’ Evaluative Judgment of Generated Code as a Decision Process. In Proceedings of the Fourth Workshop on Bridging Human-Computer Interaction and Natural Language Processing (HCI+NLP), pages 120–128, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Predictive Modeling of Human Developers’ Evaluative Judgment of Generated Code as a Decision Process (Kovalchuk et al., HCINLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.hcinlp-1.10.pdf

PDF Cite Search Fix data