Data Similarity is Not Enough to Explain Language Model Performance Gregory Yauney author Emily Reif author David Mimno author 2023-12 text Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing Houda Bouamor editor Juan Pino editor Kalika Bali editor Association for Computational Linguistics Singapore conference publication yauney-etal-2023-data 10.18653/v1/2023.emnlp-main.695 https://aclanthology.org/2023.emnlp-main.695/ 2023-12 11295 11304