G-DIG: Towards Gradient-based DIverse and hiGh-quality Instruction Data Selection for Machine Translation Xingyuan Pan author Luyang Huang author Liyan Kang author Zhicheng Liu author Yu Lu author Shanbo Cheng author 2024-08 text Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Lun-Wei Ku editor Andre Martins editor Vivek Srikumar editor Association for Computational Linguistics Bangkok, Thailand conference publication pan-etal-2024-g 10.18653/v1/2024.acl-long.821 https://aclanthology.org/2024.acl-long.821/ 2024-08 15395 15406