HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training Linjie Li author Yen-Chun Chen author Yu Cheng author Zhe Gan author Licheng Yu author Jingjing Liu author 2020-11 text Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) Bonnie Webber editor Trevor Cohn editor Yulan He editor Yang Liu editor Association for Computational Linguistics Online conference publication li-etal-2020-hero 10.18653/v1/2020.emnlp-main.161 https://aclanthology.org/2020.emnlp-main.161/ 2020-11 2046 2065