Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models Lei Li author Yuqi Wang author Runxin Xu author Peiyi Wang author Xiachong Feng author Lingpeng Kong author Qi Liu author 2024-08 text Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Lun-Wei Ku editor Andre Martins editor Vivek Srikumar editor Association for Computational Linguistics Bangkok, Thailand conference publication li-etal-2024-multimodal-arxiv 10.18653/v1/2024.acl-long.775 https://aclanthology.org/2024.acl-long.775/ 2024-08 14369 14387