Symmetric Dot-Product Attention for Efficient Training of BERT Language Models Martin Courtois author Malte Ostendorff author Leonhard Hennig author Georg Rehm author 2024-08 text Findings of the Association for Computational Linguistics: ACL 2024 Lun-Wei Ku editor Andre Martins editor Vivek Srikumar editor Association for Computational Linguistics Bangkok, Thailand conference publication courtois-etal-2024-symmetric 10.18653/v1/2024.findings-acl.476 https://aclanthology.org/2024.findings-acl.476/ 2024-08 8002 8011