TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding Shuhuai Ren author Sishuo Chen author Shicheng Li author Xu Sun author Lu Hou author 2023-12 text Findings of the Association for Computational Linguistics: EMNLP 2023 Houda Bouamor editor Juan Pino editor Kalika Bali editor Association for Computational Linguistics Singapore conference publication ren-etal-2023-testa 10.18653/v1/2023.findings-emnlp.66 https://aclanthology.org/2023.findings-emnlp.66/ 2023-12 932 947