Understanding the Difficulty of Training Transformers Liyuan Liu author Xiaodong Liu author Jianfeng Gao author Weizhu Chen author Jiawei Han author 2020-11 text Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) Bonnie Webber editor Trevor Cohn editor Yulan He editor Yang Liu editor Association for Computational Linguistics Online conference publication liu-etal-2020-understanding 10.18653/v1/2020.emnlp-main.463 https://aclanthology.org/2020.emnlp-main.463/ 2020-11 5747 5763