Efficient Sparse Attention needs Adaptive Token Release Chaoran Zhang author Lixin Zou author Dan Luo author Xiangyang Luo author Zihao Li author Min Tang author Chenliang Li author 2024-08 text Findings of the Association for Computational Linguistics: ACL 2024 Lun-Wei Ku editor Andre Martins editor Vivek Srikumar editor Association for Computational Linguistics Bangkok, Thailand conference publication zhang-etal-2024-efficient 10.18653/v1/2024.findings-acl.837 https://aclanthology.org/2024.findings-acl.837/ 2024-08 14081 14094