Do Language Models Think Consistently? A Study of Value Preferences Across Varying Response Lengths

Inderjeet Jayakumar Nair; Lu Wang

Do Language Models Think Consistently? A Study of Value Preferences Across Varying Response Lengths

Abstract

Evaluations of LLMs’ ethical risks and value inclinations often rely on short-form surveys and psychometric tests, yet real-world use involves long-form, open-ended responses—leaving value-related risks and preferences in practical settings largely underexplored. In this work, we ask: Do value preferences inferred from short-form tests align with those expressed in long-form outputs? To address this question, we compare value preferences elicited from short-form reactions and long-form responses, varying the number of arguments in the latter to capture users’ differing verbosity preferences. Analyzing five LLMs (llama3-8b, gemma2-9b, mistral-7b, qwen2-7b, and olmo-7b), we find (1) a weak correlation between value preferences inferred from short-form and long-form responses across varying argument counts, and (2) similarly weak correlation between preferences derived from any two distinct long-form generation settings. (3 Alignment yields only modest gains in the consistency of value expression. Further, we examine how long-form generation attributes relate to value preferences, finding that argument specificity negatively correlates with preference strength, while representation across scenarios shows a positive correlation. Our findings underscore the need for more robust methods to ensure consistent value expression across diverse applications.

Anthology ID:: 2026.findings-acl.514
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 10584–10613
Language:
URL:: https://aclanthology.org/2026.findings-acl.514/
DOI:
Bibkey:
Cite (ACL):: Inderjeet Jayakumar Nair and Lu Wang. 2026. Do Language Models Think Consistently? A Study of Value Preferences Across Varying Response Lengths. In Findings of the Association for Computational Linguistics: ACL 2026, pages 10584–10613, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Do Language Models Think Consistently? A Study of Value Preferences Across Varying Response Lengths (Nair & Wang, Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.514.pdf
Checklist:: 2026.findings-acl.514.checklist.pdf

PDF Cite Search Checklist Fix data