Probabilistic Transformer: A Probabilistic Dependency Model for Contextual Word Representation Haoyi Wu author Kewei Tu author 2023-07 text Findings of the Association for Computational Linguistics: ACL 2023 Anna Rogers editor Jordan Boyd-Graber editor Naoaki Okazaki editor Association for Computational Linguistics Toronto, Canada conference publication wu-tu-2023-probabilistic 10.18653/v1/2023.findings-acl.482 https://aclanthology.org/2023.findings-acl.482/ 2023-07 7613 7636