Human Attention in Visual Question Answering: Do Humans and Deep Networks look at the same regions? Abhishek Das author Harsh Agrawal author Larry Zitnick author Devi Parikh author Dhruv Batra author 2016-11 text Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing Jian Su editor Kevin Duh editor Xavier Carreras editor Association for Computational Linguistics Austin, Texas conference publication das-etal-2016-human 10.18653/v1/D16-1092 https://aclanthology.org/D16-1092/ 2016-11 932 937