Align-to-Distill: Trainable Attention Alignment for Knowledge Distillation in Neural Machine Translation Heegon Jin author Seonil Son author Jemin Park author Youngseok Kim author Hyungjong Noh author Yeonsoo Lee author 2024-05 text Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) Nicoletta Calzolari editor Min-Yen Kan editor Veronique Hoste editor Alessandro Lenci editor Sakriani Sakti editor Nianwen Xue editor ELRA and ICCL Torino, Italia conference publication jin-etal-2024-align https://aclanthology.org/2024.lrec-main.64/ 2024-05 722 732