Length Generalization of Causal Transformers without Position Encoding Jie Wang author Tao Ji author Yuanbin Wu author Hang Yan author Tao Gui author Qi Zhang author Xuanjing Huang author Xiaoling Wang author 2024-08 text Findings of the Association for Computational Linguistics: ACL 2024 Lun-Wei Ku editor Andre Martins editor Vivek Srikumar editor Association for Computational Linguistics Bangkok, Thailand conference publication wang-etal-2024-length 10.18653/v1/2024.findings-acl.834 https://aclanthology.org/2024.findings-acl.834/ 2024-08 14024 14040