Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning Zhaorui Yang author Tianyu Pang author Haozhe Feng author Han Wang author Wei Chen author Minfeng Zhu author Qian Liu author 2024-08 text Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Lun-Wei Ku editor Andre Martins editor Vivek Srikumar editor Association for Computational Linguistics Bangkok, Thailand conference publication yang-etal-2024-self 10.18653/v1/2024.acl-long.58 https://aclanthology.org/2024.acl-long.58/ 2024-08 1028 1043