MASSIVE: A 1M-Example Multilingual Natural Language Understanding Dataset with 51 Typologically-Diverse Languages Jack FitzGerald author Christopher Hench author Charith Peris author Scott Mackie author Kay Rottmann author Ana Sanchez author Aaron Nash author Liam Urbach author Vishesh Kakarala author Richa Singh author Swetha Ranganath author Laurie Crist author Misha Britan author Wouter Leeuwis author Gokhan Tur author Prem Natarajan author 2023-07 text Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Anna Rogers editor Jordan Boyd-Graber editor Naoaki Okazaki editor Association for Computational Linguistics Toronto, Canada conference publication fitzgerald-etal-2023-massive 10.18653/v1/2023.acl-long.235 https://aclanthology.org/2023.acl-long.235/ 2023-07 4277 4302