VisLingInstruct: Elevating Zero-Shot Learning in Multi-Modal Language Models with Autonomous Instruction Optimization

VisLingInstruct: Elevating Zero-Shot Learning in Multi-Modal Language Models with Autonomous Instruction Optimization Dongsheng Zhu author Daniel Tang author Weidong Han author Jinghui Lu author Yukun Zhao author Guoliang Xing author Junfeng Wang author Dawei Yin author 2024-06 text Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers) Kevin Duh editor Helena Gomez editor Steven Bethard editor Association for Computational Linguistics Mexico City, Mexico conference publication zhu-etal-2024-vislinginstruct 10.18653/v1/2024.naacl-long.117 https://aclanthology.org/2024.naacl-long.117/ 2024-06 2122 2135