M2D2: A Massively Multi-Domain Language Modeling Dataset Machel Reid author Victor Zhong author Suchin Gururangan author Luke Zettlemoyer author 2022-12 text Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing Yoav Goldberg editor Zornitsa Kozareva editor Yue Zhang editor Association for Computational Linguistics Abu Dhabi, United Arab Emirates conference publication reid-etal-2022-m2d2 10.18653/v1/2022.emnlp-main.63 https://aclanthology.org/2022.emnlp-main.63/ 2022-12 964 975