X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers Jaemin Cho author Jiasen Lu author Dustin Schwenk author Hannaneh Hajishirzi author Aniruddha Kembhavi author 2020-11 text Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) Bonnie Webber editor Trevor Cohn editor Yulan He editor Yang Liu editor Association for Computational Linguistics Online conference publication cho-etal-2020-x 10.18653/v1/2020.emnlp-main.707 https://aclanthology.org/2020.emnlp-main.707/ 2020-11 8785 8805