LLMs can be easily Confused by Instructional Distractions

Yerin Hwang; Yongil Kim; Jahyun Koo; Taegwan Kang; Hyunkyung Bae; Kyomin Jung

doi:10.18653/v1/2025.acl-long.957

LLMs can be easily Confused by Instructional Distractions

Yerin Hwang, Yongil Kim, Jahyun Koo, Taegwan Kang, Hyunkyung Bae, Kyomin Jung

Abstract

Despite the fact that large language models (LLMs) show exceptional skill in instruction following tasks, this strength can turn into a vulnerability when the models are required to disregard certain instructions. Instruction following tasks typically involve a clear task description and input text containing the target data to be processed. However, when the input itself resembles an instruction, confusion may arise, even if there is explicit prompting to distinguish between the task instruction and the input. We refer to this phenomenon as instructional distraction. In this paper, we introduce a novel benchmark, named **DIM-Bench**, specifically designed to assess LLMs’ performance under instructional distraction. The benchmark categorizes real-world instances of instructional distraction and evaluates LLMs across four instruction tasks: proofreading, rewriting, translation, and style transfer—alongside five input tasks: reasoning, code generation, mathematical reasoning, bias detection, and question answering. Our experimental results reveal that even the most advanced LLMs are susceptible to instructional distraction, often failing to accurately follow user intent in such cases.

Anthology ID:: 2025.acl-long.957
Erratum e1:: 2025.acl-long.957e1
Volume:: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 19483–19496
Language:
URL:: https://aclanthology.org/2025.acl-long.957/
DOI:: 10.18653/v1/2025.acl-long.957
Bibkey:
Cite (ACL):: Yerin Hwang, Yongil Kim, Jahyun Koo, Taegwan Kang, Hyunkyung Bae, and Kyomin Jung. 2025. LLMs can be easily Confused by Instructional Distractions. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 19483–19496, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: LLMs can be easily Confused by Instructional Distractions (Hwang et al., ACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.acl-long.957.pdf

PDF Cite Search Fix data