mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections Chenliang Li author Haiyang Xu author Junfeng Tian author Wei Wang author Ming Yan author Bin Bi author Jiabo Ye author He Chen author Guohai Xu author Zheng Cao author Ji Zhang author Songfang Huang author Fei Huang author Jingren Zhou author Luo Si author 2022-12 text Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing Yoav Goldberg editor Zornitsa Kozareva editor Yue Zhang editor Association for Computational Linguistics Abu Dhabi, United Arab Emirates conference publication li-etal-2022-mplug 10.18653/v1/2022.emnlp-main.488 https://aclanthology.org/2022.emnlp-main.488/ 2022-12 7241 7259