Efficient Long-Range Transformers: You Need to Attend More, but Not Necessarily at Every Layer Qingru Zhang author Dhananjay Ram author Cole Hawkins author Sheng Zha author Tuo Zhao author 2023-12 text Findings of the Association for Computational Linguistics: EMNLP 2023 Houda Bouamor editor Juan Pino editor Kalika Bali editor Association for Computational Linguistics Singapore conference publication zhang-etal-2023-efficient-long 10.18653/v1/2023.findings-emnlp.183 https://aclanthology.org/2023.findings-emnlp.183/ 2023-12 2775 2786