Causal and Temporal Inference in Visual Question Generation by Utilizing Pre-trained Models Zhanghao Hu author Frank Keller author 2024-08 text Proceedings of the 3rd Workshop on Advances in Language and Vision Research (ALVR) Jing Gu editor Tsu-Jui (Ray) Fu editor Drew Hudson editor Asli Celikyilmaz editor William Wang editor Association for Computational Linguistics Bangkok, Thailand conference publication hu-keller-2024-causal 10.18653/v1/2024.alvr-1.12 https://aclanthology.org/2024.alvr-1.12/ 2024-08 138 154