SCOPE: Boosting LLM Efficiency with Scoped Position Encoding

Qingguo Qi; Hongyang Chen; Zhao Li

SCOPE: Boosting LLM Efficiency with Scoped Position Encoding

Abstract

Positional encodings are fundamental to Transformers, yet explicit methods like RoPE can degrade under length extrapolation and may incur extra arithmetic and memory-access overhead. In this paper, we propose Scoped Position Encoding (ScoPE), a novel framework that reimagines structured sparsity as an intrinsic position encoding mechanism. Instead of relying on explicit arithmetic signals, ScoPE assigns exponentially scaled look-back scopes to attention heads. We theoretically demonstrate that this simple topological constraint transforms multi-head attention into a hierarchical processor, yielding an order awareness horizon that grows exponentially with depth up to the sequence length. Consequently, ScoPE is parameter-free and avoids relying on fragile positional arithmetic. Empirically, it significantly enhances efficiency by masking the majority of attention computations, offering a theoretical 8x reduction in attention FLOPs at long contexts. Extensive evaluations on LLaMA-3-8B architectures reveal that ScoPE achieves superior native length extrapolation and robust retrieval fidelity compared to RoPE, all while substantially reducing training and inference latency. The code is available at https://github.com/oncemoe/ScoPE.

Anthology ID:: 2026.acl-long.1650
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 35653–35673
Language:
URL:: https://aclanthology.org/2026.acl-long.1650/
DOI:
Bibkey:
Cite (ACL):: Qingguo Qi, Hongyang Chen, and Zhao Li. 2026. SCOPE: Boosting LLM Efficiency with Scoped Position Encoding. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 35653–35673, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: SCOPE: Boosting LLM Efficiency with Scoped Position Encoding (Qi et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.1650.pdf
Checklist:: 2026.acl-long.1650.checklist.pdf

PDF Cite Search Checklist Fix data