Accelerating Neural Transformer via an Average Attention Network Biao Zhang author Deyi Xiong author Jinsong Su author 2018-07 text Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Iryna Gurevych editor Yusuke Miyao editor Association for Computational Linguistics Melbourne, Australia conference publication zhang-etal-2018-accelerating 10.18653/v1/P18-1166 https://aclanthology.org/P18-1166/ 2018-07 1789 1798