Autoregressive Knowledge Distillation through Imitation Learning Alexander Lin author Jeremy Wohlwend author Howard Chen author Tao Lei author 2020-11 text Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) Bonnie Webber editor Trevor Cohn editor Yulan He editor Yang Liu editor Association for Computational Linguistics Online conference publication lin-etal-2020-autoregressive 10.18653/v1/2020.emnlp-main.494 https://aclanthology.org/2020.emnlp-main.494/ 2020-11 6121 6133