Word Sense Disambiguation for Marathi language using Supervised Learning

Ransing Rasika, Gulati Archana


Abstract
The task of disambiguating word senses, often referred to as Word Sense Disambiguation (WSD), is a substantial difficulty in the realm of natural language processing. Marathi is widely acknowledged as a language that has a relatively restricted range of resources. Consequently, there has been a paucity of academic research undertaken on the Marathi language. There has been little research conducted on supervised learning for Marathi Word Sense Disambiguation (WSD) mostly owing to the scarcity of sense-annotated corpora. This work aims to construct a sense-annotated corpus for the Marathi language and further use supervised learning classifiers, such as Naïve Bayes, Support Vector Machine, Random Forest, and Logistic Regression, to disambiguate polysemous words in Marathi. The performance of these classifiers is evaluated.
Anthology ID:
2023.icon-1.76
Volume:
Proceedings of the 20th International Conference on Natural Language Processing (ICON)
Month:
December
Year:
2023
Address:
Goa University, Goa, India
Editors:
D. Pawar Jyoti, Lalitha Devi Sobha
Venue:
ICON
SIG:
SIGLEX
Publisher:
NLP Association of India (NLPAI)
Note:
Pages:
754–759
Language:
URL:
https://aclanthology.org/2023.icon-1.76
DOI:
Bibkey:
Cite (ACL):
Ransing Rasika and Gulati Archana. 2023. Word Sense Disambiguation for Marathi language using Supervised Learning. In Proceedings of the 20th International Conference on Natural Language Processing (ICON), pages 754–759, Goa University, Goa, India. NLP Association of India (NLPAI).
Cite (Informal):
Word Sense Disambiguation for Marathi language using Supervised Learning (Rasika & Archana, ICON 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.icon-1.76.pdf