Shabdocchar: Konkani WordNet Enrichment with Audio Feature

Sunayana R. Gawde, Shrikrishna R. Parab, Jayram Ulhas Gawas, Shilpa Neenad Desai, Jyoti Pawar


Abstract
Konkani WordNet, also called Konkani Shabdamalem, was created as part of the Indradhanush WordNet Project Consortium between August 2010 and October 2013. Currently, the Konkani WordNet includes about 32,370 synsets and 37,719 unique words. There is a need to enhance the Konkani WordNet both quantitatively as well as qualitatively. In this paper we are presenting a Game-Based Crowdsourcing approach adopted by us to add audio feature to the Konkani WordNet which has resulted in an increase in the number of users using and getting exposed to the capabilities of the Konkani WordNet to aid in the Konkani language teaching-learning process as well as for creation of resources to initiate further research. Our work presented here has resulted in the creation of an audio corpus of 37,719 unique words which we have named as ‘Shabdocchar’ within a short time span of four months covering five dialects of Konkani. We are confident that Shabdocchar will prove to be a very useful resource to support future research work on Dialects of Konkani and support voice-based search of words in the wordnet. This approach can be adopted to enhance other wordnets as well.
Anthology ID:
2024.icon-1.62
Volume:
Proceedings of the 21st International Conference on Natural Language Processing (ICON)
Month:
December
Year:
2024
Address:
AU-KBC Research Centre, Chennai, India
Editors:
Sobha Lalitha Devi, Karunesh Arora
Venue:
ICON
SIG:
Publisher:
NLP Association of India (NLPAI)
Note:
Pages:
531–536
Language:
URL:
https://aclanthology.org/2024.icon-1.62/
DOI:
Bibkey:
Cite (ACL):
Sunayana R. Gawde, Shrikrishna R. Parab, Jayram Ulhas Gawas, Shilpa Neenad Desai, and Jyoti Pawar. 2024. Shabdocchar: Konkani WordNet Enrichment with Audio Feature. In Proceedings of the 21st International Conference on Natural Language Processing (ICON), pages 531–536, AU-KBC Research Centre, Chennai, India. NLP Association of India (NLPAI).
Cite (Informal):
Shabdocchar: Konkani WordNet Enrichment with Audio Feature (R. Gawde et al., ICON 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.icon-1.62.pdf