User-Assistant Bias in LLMs

Xu Pan; Jingxuan Fan; Zidi Xiong; Ely Hahami; Jorin Overwiening; Ziqian Xie

User-Assistant Bias in LLMs

Xu Pan, Jingxuan Fan, Zidi Xiong, Ely Hahami, Jorin Overwiening, Ziqian Xie

Abstract

Modern large language models (LLMs) are typically trained and deployed using structured role tags (e.g. system, user, assistant, tool) that explicitly mark the source of each piece of context. While these tags are essential for instruction following and controllability, asymmetries in the training data associated with different role tags can potentially introduce inductive biases. In this paper, we study this phenomenon by formalizing user–assistant bias, defined as the tendency of an LLM to preferentially rely on information from either the user or assistant role when they provide incompatible information about the same entity in the context history. We introduce a task-agnostic benchmark UserAssist and evaluate such bias in 52 frontier models. We observe that most of the instruction-tuned models exhibit strong user bias, whereas base and reasoning models are close to neutral. Using controlled fine-tuning experiments, we isolate which post-training recipes drive the observed user–assistant bias. We find that human-preference alignment amplifies user bias, while reasoning fine-tuning reduces it. Finally, we show that user–assistant bias can be bidirectionally controlled via direct preference optimization (DPO) on UserAssist-train, and that the resulting bias reliably generalizes to two realistic multi-turn debate datasets spanning philosophical opinions and natural argumentative exchanges on factual/policy topics. These results reveal an underexplored consequence of role-tagged training and provide a principled framework to diagnose and control tag-induced biases in modern LLMs.

Anthology ID:: 2026.findings-acl.449
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 9218–9241
Language:
URL:: https://aclanthology.org/2026.findings-acl.449/
DOI:
Bibkey:
Cite (ACL):: Xu Pan, Jingxuan Fan, Zidi Xiong, Ely Hahami, Jorin Overwiening, and Ziqian Xie. 2026. User-Assistant Bias in LLMs. In Findings of the Association for Computational Linguistics: ACL 2026, pages 9218–9241, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: User-Assistant Bias in LLMs (Pan et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.449.pdf
Checklist:: 2026.findings-acl.449.checklist.pdf

PDF Cite Search Checklist Fix data