LLaVA-RE: Binary Image-Text Relevancy Evaluation with Multimodal Large Language Model Tao Sun author Oliver Liu author JinJin Li author Lan Ma author 2025-01 text Proceedings of the First Workshop of Evaluation of Multi-Modal Generation Wei Emma Zhang editor Xiang Dai editor Desmond Elliot editor Byron Fang editor Mongyuan Sim editor Haojie Zhuang editor Weitong Chen editor Association for Computational Linguistics Abu Dhabi, UAE conference publication sun-etal-2025-llava https://aclanthology.org/2025.evalmg-1.4/ 2025-01 40 51