Fatoumata Kabore
2022
MasakhaNER 2.0: Africa-centric Transfer Learning for Named Entity Recognition
David Ifeoluwa Adelani | Graham Neubig | Sebastian Ruder | Shruti Rijhwani | Michael Beukman | Chester Palen-Michel | Constantine Lignos | Jesujoba O. Alabi | Shamsuddeen H. Muhammad | Peter Nabende | Cheikh M. Bamba Dione | Andiswa Bukula | Rooweither Mabuya | Bonaventure F. P. Dossou | Blessing Sibanda | Happy Buzaaba | Jonathan Mukiibi | Godson Kalipe | Derguene Mbaye | Amelia Taylor | Fatoumata Kabore | Chris Chinenye Emezue | Anuoluwapo Aremu | Perez Ogayo | Catherine Gitau | Edwin Munkoh-Buabeng | Victoire Memdjokam Koagne | Allahsera Auguste Tapo | Tebogo Macucwa | Vukosi Marivate | Elvis Mboning | Tajuddeen Gwadabe | Tosin Adewumi | Orevaoghene Ahia | Joyce Nakatumba-Nabende | Neo L. Mokono | Ignatius Ezeani | Chiamaka Chukwuneke | Mofetoluwa Adeyemi | Gilles Q. Hacheme | Idris Abdulmumin | Odunayo Ogundepo | Oreen Yousuf | Tatiana Moteu Ngoli | Dietrich Klakow
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
David Ifeoluwa Adelani | Graham Neubig | Sebastian Ruder | Shruti Rijhwani | Michael Beukman | Chester Palen-Michel | Constantine Lignos | Jesujoba O. Alabi | Shamsuddeen H. Muhammad | Peter Nabende | Cheikh M. Bamba Dione | Andiswa Bukula | Rooweither Mabuya | Bonaventure F. P. Dossou | Blessing Sibanda | Happy Buzaaba | Jonathan Mukiibi | Godson Kalipe | Derguene Mbaye | Amelia Taylor | Fatoumata Kabore | Chris Chinenye Emezue | Anuoluwapo Aremu | Perez Ogayo | Catherine Gitau | Edwin Munkoh-Buabeng | Victoire Memdjokam Koagne | Allahsera Auguste Tapo | Tebogo Macucwa | Vukosi Marivate | Elvis Mboning | Tajuddeen Gwadabe | Tosin Adewumi | Orevaoghene Ahia | Joyce Nakatumba-Nabende | Neo L. Mokono | Ignatius Ezeani | Chiamaka Chukwuneke | Mofetoluwa Adeyemi | Gilles Q. Hacheme | Idris Abdulmumin | Odunayo Ogundepo | Oreen Yousuf | Tatiana Moteu Ngoli | Dietrich Klakow
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
African languages are spoken by over a billion people, but they are under-represented in NLP research and development. Multiple challenges exist, including the limited availability of annotated training and evaluation datasets as well as the lack of understanding of which settings, languages, and recently proposed methods like cross-lingual transfer will be effective. In this paper, we aim to move towards solutions for these challenges, focusing on the task of named entity recognition (NER). We present the creation of the largest to-date human-annotated NER dataset for 20 African languages. We study the behaviour of state-of-the-art cross-lingual transfer methods in an Africa-centric setting, empirically demonstrating that the choice of source transfer language significantly affects performance. While much previous work defaults to using English as the source language, our results show that choosing the best transfer language improves zero-shot F1 scores by an average of 14% over 20 languages as compared to using English.
Search
Fix author
Co-authors
- Idris Abdulmumin 1
- David Ifeoluwa Adelani 1
- Tosin Adewumi 1
- Mofetoluwa Adeyemi 1
- Orevaoghene Ahia 1
- Jesujoba Alabi 1
- Anuoluwapo Aremu 1
- Michael Beukman 1
- Andiswa Bukula 1
- Happy Buzaaba 1
- Chiamaka Chukwuneke 1
- Cheikh M. Bamba Dione 1
- Bonaventure F. P. Dossou 1
- Chris Chinenye Emezue 1
- Ignatius Ezeani 1
- Catherine Gitau 1
- Tajuddeen Gwadabe 1
- Gilles Q. Hacheme 1
- Godson Kalipe 1
- Dietrich Klakow 1
- Constantine Lignos 1
- Rooweither Mabuya 1
- Tebogo Macucwa 1
- Vukosi Marivate 1
- Derguene Mbaye 1
- Elvis Mboning 1
- Victoire Memdjokam Koagne 1
- Neo L. Mokono 1
- Tatiana Moteu Ngoli 1
- Shamsuddeen Hassan Muhammad 1
- Jonathan Mukiibi 1
- Edwin Munkoh-Buabeng 1
- Peter Nabende 1
- Joyce Nakatumba-Nabende 1
- Graham Neubig 1
- Perez Ogayo 1
- Odunayo Ogundepo 1
- Chester Palen-Michel 1
- Shruti Rijhwani 1
- Sebastian Ruder 1
- Blessing Kudzaishe Sibanda 1
- Allahsera Auguste Tapo 1
- Amelia Taylor 1
- Oreen Yousuf 1