Understanding and Improving Knowledge Distillation for Quantization Aware Training of Large Transformer Encoders Minsoo Kim author Sihwa Lee author Suk-Jin Hong author Du-Seong Chang author Jungwook Choi author 2022-12 text Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing Yoav Goldberg editor Zornitsa Kozareva editor Yue Zhang editor Association for Computational Linguistics Abu Dhabi, United Arab Emirates conference publication kim-etal-2022-understanding 10.18653/v1/2022.emnlp-main.450 https://aclanthology.org/2022.emnlp-main.450/ 2022-12 6713 6725