Investigating Value-Reasoning Reliability in Small Large Language Models

Xia Du; Shuhan Sun; Pengyuan Liu (刘鹏远); Dong Yu (于东)

doi:10.18653/v1/2025.emnlp-main.395

Investigating Value-Reasoning Reliability in Small Large Language Models

Xia Du, Shuhan Sun, Pengyuan Liu, Dong Yu

Abstract

Although small Large Language models (sLLMs) have been widely deployed in practical applications, little attention has been paid to their value-reasoning abilities, particularly in terms of reasoning reliability. To address this gap, we propose a systematic evaluation framework for assessing the Value-Reasoning Reliability of sLLMs. We define Value-Reasoning Reliability as comprising: (1) Output consistency under identical prompts, (2) Output Robustness under semantically equivalent prompts, (3) Maintaining stable value reasoning in the face of attacks, and (4) Consistency of value reasoning in open-ended value expression tasks. Our framework includes three core tasks: Repetition Consistency task, Interaction Stability task, and Open-ended Expression Consistency task. We further incorporate self-reported confidence scores to evaluate the model’s value reasoning reliability from two perspectives: the model’s self-awareness of its values, and its value-based decision-making. Our findings show that models vary significantly in their stability when responding to value-related questions. Moreover, we observe considerable output randomness, which is not always correlated with the self-reported confidence or expressed value preferences. This suggests that current models lack a reliable internal mechanism for stable value reasoning when addressing value-sensitive queries.

Anthology ID:: 2025.emnlp-main.395
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 7746–7786
Language:
URL:: https://aclanthology.org/2025.emnlp-main.395/
DOI:: 10.18653/v1/2025.emnlp-main.395
Bibkey:
Cite (ACL):: Xia Du, Shuhan Sun, Pengyuan Liu, and Dong Yu. 2025. Investigating Value-Reasoning Reliability in Small Large Language Models. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 7746–7786, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Investigating Value-Reasoning Reliability in Small Large Language Models (Du et al., EMNLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.emnlp-main.395.pdf
Checklist:: 2025.emnlp-main.395.checklist.pdf

PDF Cite Search Checklist Fix data