A Comparative Analysis of Task-Agnostic Distillation Methods for Compressing Transformer Language Models Takuma Udagawa author Aashka Trivedi author Michele Merler author Bishwaranjan Bhattacharjee author 2023-12 text Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: Industry Track Mingxuan Wang editor Imed Zitouni editor Association for Computational Linguistics Singapore conference publication udagawa-etal-2023-comparative 10.18653/v1/2023.emnlp-industry.3 https://aclanthology.org/2023.emnlp-industry.3/ 2023-12 20 31