If I feel smart, I will do the right thing: Combining Complementary Multimodal Information in Visual Language Models

If I feel smart, I will do the right thing: Combining Complementary Multimodal Information in Visual Language Models Yuyu Bai author Sandro Pezzelle author 2025-01 text Proceedings of the First Workshop of Evaluation of Multi-Modal Generation Wei Emma Zhang editor Xiang Dai editor Desmond Elliot editor Byron Fang editor Mongyuan Sim editor Haojie Zhuang editor Weitong Chen editor Association for Computational Linguistics Abu Dhabi, UAE conference publication bai-pezzelle-2025-feel https://aclanthology.org/2025.evalmg-1.3/ 2025-01 24 39