WISMIR3: A Multi-Modal Dataset to Challenge Text-Image Retrieval Approaches Florian Schneider author Chris Biemann author 2024-08 text Proceedings of the 3rd Workshop on Advances in Language and Vision Research (ALVR) Jing Gu editor Tsu-Jui (Ray) Fu editor Drew Hudson editor Asli Celikyilmaz editor William Wang editor Association for Computational Linguistics Bangkok, Thailand conference publication schneider-biemann-2024-wismir3 10.18653/v1/2024.alvr-1.1 https://aclanthology.org/2024.alvr-1.1/ 2024-08 1 6