VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding Hu Xu author Gargi Ghosh author Po-Yao Huang author Dmytro Okhonko author Armen Aghajanyan author Florian Metze author Luke Zettlemoyer author Christoph Feichtenhofer author 2021-11 text Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing Marie-Francine Moens editor Xuanjing Huang editor Lucia Specia editor Scott Wen-tau Yih editor Association for Computational Linguistics Online and Punta Cana, Dominican Republic conference publication xu-etal-2021-videoclip 10.18653/v1/2021.emnlp-main.544 https://aclanthology.org/2021.emnlp-main.544/ 2021-11 6787 6800