Resonant Context Anchoring: Decoupling Attention Routing and Signal Gain at Inference Time

Mingkuan Zhao; Yide Gao; Wentao Hu; Suquan Chen; Tianchen Huang; Zhenhua An; Zetao Chang; Xiayu Sun; Yuheng Min

Resonant Context Anchoring: Decoupling Attention Routing and Signal Gain at Inference Time

Mingkuan Zhao, Yide Gao, Wentao Hu, Suquan Chen, Tianchen Huang, Zhenhua An, Zetao Chang, Xiayu Sun, Yuheng Min

Abstract

Large Language Models (LLMs) frequently exhibit “contextual disregard” when faced with input evidence that conflicts with their internal parametric memory, leading to persistent factual hallucinations. Existing mitigation strategies primarily rely on suppressing specific neuron activations or employing computationally expensive contrastive decoding mechanisms, which often result in increased perplexity or significantly elevated inference latency. To address these limitations, we propose Resonant Context Anchoring (RCA), a lightweight inference-time intervention method grounded in the perspective of residual stream signal dynamics. RCA aims to resolve the signal attenuation of external evidence during its propagation through deep networks. The core mechanism involves the orthogonal decoupling of routing logic and information magnitude within the self-attention module. By utilizing raw pre-softmax attention scores as an instantaneous metric of semantic alignment, we construct a dynamic gain field via non-linear rectification to selectively amplify the norms of value vectors corresponding to context tokens, without altering the attention probability distribution. This mechanism effectively elevates the signal-to-noise ratio (SNR) of input evidence within the residual stream mixture, thereby robustly anchoring the generation trajectory to the truthful context during inference. Extensive experiments on the Llama-3 model series demonstrate that RCA significantly improves contextual faithfulness across multiple factual consistency and strong knowledge-conflict tasks, effectively suppressing parametric hallucinations. Furthermore, results confirm that as a training-free and computationally negligible plug-and-play module, RCA achieves a Pareto improvement in faithfulness and fluency while maintaining the model’s general language understanding capabilities. Our code is available at https://anonymous.4open.science/r/RCA-Implementation-D8B5

Anthology ID:: 2026.findings-acl.1824
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 36600–36611
Language:
URL:: https://aclanthology.org/2026.findings-acl.1824/
DOI:
Bibkey:
Cite (ACL):: Mingkuan Zhao, Yide Gao, Wentao Hu, Suquan Chen, Tianchen Huang, Zhenhua An, Zetao Chang, Xiayu Sun, and Yuheng Min. 2026. Resonant Context Anchoring: Decoupling Attention Routing and Signal Gain at Inference Time. In Findings of the Association for Computational Linguistics: ACL 2026, pages 36600–36611, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Resonant Context Anchoring: Decoupling Attention Routing and Signal Gain at Inference Time (Zhao et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.1824.pdf
Checklist:: 2026.findings-acl.1824.checklist.pdf

PDF Cite Search Checklist Fix data