End-to-End Unsupervised Vision-and-Language Pre-training with Referring Expression Matching Chi Chen author Peng Li author Maosong Sun author Yang Liu author 2022-12 text Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing Yoav Goldberg editor Zornitsa Kozareva editor Yue Zhang editor Association for Computational Linguistics Abu Dhabi, United Arab Emirates conference publication chen-etal-2022-end 10.18653/v1/2022.emnlp-main.742 https://aclanthology.org/2022.emnlp-main.742/ 2022-12 10799 10810