ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning Xiao Xu author Bei Li author Chenfei Wu author Shao-Yen Tseng author Anahita Bhiwandiwalla author Shachar Rosenman author Vasudev Lal author Wanxiang Che author Nan Duan author 2023-07 text Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Anna Rogers editor Jordan Boyd-Graber editor Naoaki Okazaki editor Association for Computational Linguistics Toronto, Canada conference publication xu-etal-2023-managertower 10.18653/v1/2023.acl-long.811 https://aclanthology.org/2023.acl-long.811/ 2023-07 14507 14525