XDoc: Unified Pre-training for Cross-Format Document Understanding Jingye Chen author Tengchao Lv author Lei Cui author Cha Zhang author Furu Wei author 2022-12 text Findings of the Association for Computational Linguistics: EMNLP 2022 Yoav Goldberg editor Zornitsa Kozareva editor Yue Zhang editor Association for Computational Linguistics Abu Dhabi, United Arab Emirates conference publication chen-etal-2022-xdoc 10.18653/v1/2022.findings-emnlp.71 https://aclanthology.org/2022.findings-emnlp.71/ 2022-12 1006 1016