LiteVL: Efficient Video-Language Learning with Enhanced Spatial-Temporal Modeling Dongsheng Chen author Chaofan Tao author Lu Hou author Lifeng Shang author Xin Jiang author Qun Liu author 2022-12 text Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing Yoav Goldberg editor Zornitsa Kozareva editor Yue Zhang editor Association for Computational Linguistics Abu Dhabi, United Arab Emirates conference publication chen-etal-2022-litevl 10.18653/v1/2022.emnlp-main.545 https://aclanthology.org/2022.emnlp-main.545/ 2022-12 7985 7997