A Multiscale Visualization of Attention in the Transformer Model Jesse Vig author 2019-07 text Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: System Demonstrations Marta R Costa-jussà editor Enrique Alfonseca editor Association for Computational Linguistics Florence, Italy conference publication vig-2019-multiscale 10.18653/v1/P19-3007 https://aclanthology.org/P19-3007/ 2019-07 37 42