Compressing Large-Scale Transformer-Based Models: A Case Study on BERT Prakhar Ganesh author Yao Chen author Xin Lou author Mohammad Ali Khan author Yin Yang author Hassan Sajjad author Preslav Nakov author Deming Chen author Marianne Winslett author 2021 text journal article Transactions of the Association for Computational Linguistics continuing MIT Press Cambridge, MA periodical academic journal ganesh-etal-2021-compressing 10.1162/tacl_a_00413 https://aclanthology.org/2021.tacl-1.63/ 2021 9 1061 1080