SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects David Ifeoluwa Adelani author Hannah Liu author Xiaoyu Shen author Nikita Vassilyev author Jesujoba O Alabi author Yanke Mao author Haonan Gao author En-Shiun Annie Lee author 2024-03 text Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers) Yvette Graham editor Matthew Purver editor Association for Computational Linguistics St. Julian’s, Malta conference publication adelani-etal-2024-sib 10.18653/v1/2024.eacl-long.14 https://aclanthology.org/2024.eacl-long.14/ 2024-03 226 245