Multi-modal Semantic Understanding with Contrastive Cross-modal Feature Alignment Ming Zhang author Ke Chang author Yunfang Wu author 2024-05 text Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) Nicoletta Calzolari editor Min-Yen Kan editor Veronique Hoste editor Alessandro Lenci editor Sakriani Sakti editor Nianwen Xue editor ELRA and ICCL Torino, Italia conference publication zhang-etal-2024-multi https://aclanthology.org/2024.lrec-main.1042/ 2024-05 11934 11943