%0 Conference Proceedings %T VideoINSTA: Zero-shot Long Video Understanding via Informative Spatial-Temporal Reasoning with LLMs %A Liao, Ruotong %A Erler, Max %A Wang, Huiyu %A Zhai, Guangyao %A Zhang, Gengyuan %A Ma, Yunpu %A Tresp, Volker %Y Al-Onaizan, Yaser %Y Bansal, Mohit %Y Chen, Yun-Nung %S Findings of the Association for Computational Linguistics: EMNLP 2024 %D 2024 %8 November %I Association for Computational Linguistics %C Miami, Florida, USA %F liao-etal-2024-videoinsta %R 10.18653/v1/2024.findings-emnlp.384 %U https://aclanthology.org/2024.findings-emnlp.384/ %U https://doi.org/10.18653/v1/2024.findings-emnlp.384 %P 6577-6602