Generating Image Descriptions via Sequential Cross-Modal Alignment Guided by Human Gaze Ece Takmaz author Sandro Pezzelle author Lisa Beinborn author Raquel Fernández author 2020-11 text Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) Bonnie Webber editor Trevor Cohn editor Yulan He editor Yang Liu editor Association for Computational Linguistics Online conference publication takmaz-etal-2020-generating 10.18653/v1/2020.emnlp-main.377 https://aclanthology.org/2020.emnlp-main.377/ 2020-11 4664 4677