Learning When to Concentrate or Divert Attention: Self-Adaptive Attention Temperature for Neural Machine Translation Junyang Lin author Xu Sun author Xuancheng Ren author Muyu Li author Qi Su author 2018-oct-nov text Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing Ellen Riloff editor David Chiang editor Julia Hockenmaier editor Jun’ichi Tsujii editor Association for Computational Linguistics Brussels, Belgium conference publication lin-etal-2018-learning 10.18653/v1/D18-1331 https://aclanthology.org/D18-1331/ 2018-oct-nov 2985 2990