Localization vs. Semantics: Visual Representations in Unimodal and Multimodal Models Zhuowan Li author Cihang Xie author Benjamin Van Durme author Alan Yuille author 2024-03 text Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers) Yvette Graham editor Matthew Purver editor Association for Computational Linguistics St. Julian’s, Malta conference publication li-etal-2024-localization 10.18653/v1/2024.eacl-long.146 https://aclanthology.org/2024.eacl-long.146/ 2024-03 2378 2390