Efficient Sparse Attention needs Adaptive Token Release

Efficient Sparse Attention needs Adaptive Token Release Chaoran Zhang author Lixin Zou author Dan Luo author Xiangyang Luo author Zihao Li author Min Tang author Chenliang Li author 2024-08 text Findings of the Association for Computational Linguistics: ACL 2024 Lun-Wei Ku editor Andre Martins editor Vivek Srikumar editor Association for Computational Linguistics Bangkok, Thailand conference publication zhang-etal-2024-efficient 10.18653/v1/2024.findings-acl.837 https://aclanthology.org/2024.findings-acl.837/ 2024-08 14081 14094