Does my multimodal model learn cross-modal interactions? It’s harder to tell than you might think! Jack Hessel author Lillian Lee author 2020-11 text Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) Bonnie Webber editor Trevor Cohn editor Yulan He editor Yang Liu editor Association for Computational Linguistics Online conference publication hessel-lee-2020-multimodal 10.18653/v1/2020.emnlp-main.62 https://aclanthology.org/2020.emnlp-main.62/ 2020-11 861 877