The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants Lucas Bandarkar author Davis Liang author Benjamin Muller author Mikel Artetxe author Satya Narayan Shukla author Donald Husa author Naman Goyal author Abhinandan Krishnan author Luke Zettlemoyer author Madian Khabsa author 2024-08 text Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Lun-Wei Ku editor Andre Martins editor Vivek Srikumar editor Association for Computational Linguistics Bangkok, Thailand conference publication bandarkar-etal-2024-belebele 10.18653/v1/2024.acl-long.44 https://aclanthology.org/2024.acl-long.44/ 2024-08 749 775