Cross-modal Attention Congruence Regularization for Vision-Language Relation Alignment Rohan Pandey author Rulin Shao author Paul Pu Liang author Ruslan Salakhutdinov author Louis-Philippe Morency author 2023-07 text Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Anna Rogers editor Jordan Boyd-Graber editor Naoaki Okazaki editor Association for Computational Linguistics Toronto, Canada conference publication pandey-etal-2023-cross 10.18653/v1/2023.acl-long.298 https://aclanthology.org/2023.acl-long.298/ 2023-07 5444 5455