DocSplit: Simple Contrastive Pretraining for Large Document Embeddings Yujie Wang author Mike Izbicki author 2023-12 text Findings of the Association for Computational Linguistics: EMNLP 2023 Houda Bouamor editor Juan Pino editor Kalika Bali editor Association for Computational Linguistics Singapore conference publication wang-izbicki-2023-docsplit 10.18653/v1/2023.findings-emnlp.945 https://aclanthology.org/2023.findings-emnlp.945/ 2023-12 14190 14196