Text-to-Multimodal Retrieval with Bimodal Input Fusion in Shared Cross-Modal Transformer Pranav Arora author Selen Pehlivan author Jorma Laaksonen author 2024-05 text Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) Nicoletta Calzolari editor Min-Yen Kan editor Veronique Hoste editor Alessandro Lenci editor Sakriani Sakti editor Nianwen Xue editor ELRA and ICCL Torino, Italia conference publication arora-etal-2024-text https://aclanthology.org/2024.lrec-main.1374/ 2024-05 15823 15834