CODIS: Benchmarking Context-dependent Visual Comprehension for Multimodal Large Language Models

CODIS: Benchmarking Context-dependent Visual Comprehension for Multimodal Large Language Models Fuwen Luo author Chi Chen author Zihao Wan author Zhaolu Kang author Qidong Yan author Yingjie Li author Xiaolong Wang author Siyu Wang author Ziyue Wang author Xiaoyue Mi author Peng Li author Ning Ma author Maosong Sun author Yang Liu author 2024-08 text Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Lun-Wei Ku editor Andre Martins editor Vivek Srikumar editor Association for Computational Linguistics Bangkok, Thailand conference publication luo-etal-2024-codis 10.18653/v1/2024.acl-long.573 https://aclanthology.org/2024.acl-long.573/ 2024-08 10639 10659