VideoCoT: A Video Chain-of-Thought Dataset with Active Annotation Tool Yan Wang author Yawen Zeng author Jingsheng Zheng author Xiaofen Xing author Jin Xu author Xiangmin Xu author 2024-08 text Proceedings of the 3rd Workshop on Advances in Language and Vision Research (ALVR) Jing Gu editor Tsu-Jui (Ray) Fu editor Drew Hudson editor Asli Celikyilmaz editor William Wang editor Association for Computational Linguistics Bangkok, Thailand conference publication wang-etal-2024-videocot 10.18653/v1/2024.alvr-1.8 https://aclanthology.org/2024.alvr-1.8/ 2024-08 92 101