Why Skip If You Can Combine: A Simple Knowledge Distillation Technique for Intermediate Layers Yimeng Wu author Peyman Passban author Mehdi Rezagholizadeh author Qun Liu author 2020-11 text Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) Bonnie Webber editor Trevor Cohn editor Yulan He editor Yang Liu editor Association for Computational Linguistics Online conference publication wu-etal-2020-skip 10.18653/v1/2020.emnlp-main.74 https://aclanthology.org/2020.emnlp-main.74/ 2020-11 1016 1021