VSTAR: A Video-grounded Dialogue Dataset for Situated Semantic Understanding with Scene and Topic Transitions Yuxuan Wang author Zilong Zheng author Xueliang Zhao author Jinpeng Li author Yueqian Wang author Dongyan Zhao author 2023-07 text Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Anna Rogers editor Jordan Boyd-Graber editor Naoaki Okazaki editor Association for Computational Linguistics Toronto, Canada conference publication wang-etal-2023-vstar 10.18653/v1/2023.acl-long.276 https://aclanthology.org/2023.acl-long.276/ 2023-07 5036 5048