Tshephisho Joseph Sefara
2021
Practical Approach on Implementation of WordNets for South African Languages
Tshephisho Joseph Sefara
|
Tumisho Billson Mokgonyane
|
Vukosi Marivate
Proceedings of the 11th Global Wordnet Conference
This paper proposes the implementation of WordNets for five South African languages, namely, Sepedi, Setswana, Tshivenda, isiZulu and isiXhosa to be added to open multilingual WordNets (OMW) on natural language toolkit (NLTK). The African WordNets are converted from Princeton WordNet (PWN) 2.0 to 3.0 to match the synsets in PWN 3.0. After conversion, there were 7157, 11972, 1288, 6380, and 9460 lemmas for Sepedi, Setswana, Tshivenda, isiZulu and isiX- hosa respectively. Setswana, isiXhosa, Sepedi contains more lemmas compared to 8 languages in OMW and isiZulu contains more lemmas compared to 7 languages in OMW. A library has been published for continuous development of African WordNets in OMW using NLTK.
2019
Yorùbá Gender Recognition from Speech using Attention-based BiLSTM
Ibukunola Abosede Modupe
|
Tshephisho Joseph Sefara
|
Ojo Sunday
Proceedings of the First International Workshop on NLP Solutions for Under Resourced Languages (NSURL 2019) co-located with ICNLSP 2019 - Short Papers