2020
pdf
bib
abs
Part-of-Speech Annotation Challenges in Marathi
Gajanan Rane
|
Nilesh Joshi
|
Geetanjali Rane
|
Hanumant Redkar
|
Malhar Kulkarni
|
Pushpak Bhattacharyya
Proceedings of the WILDRE5– 5th Workshop on Indian Language Data: Resources and Evaluation
Part of Speech (POS) annotation is a significant challenge in natural language processing. The paper discusses issues and challenges faced in the process of POS annotation of the Marathi data from four domains viz., tourism, health, entertainment and agriculture. During POS annotation, a lot of issues were encountered. Some of the major ones are discussed in detail in this paper. Also, the two approaches viz., the lexical (L approach) and the functional (F approach) of POS tagging have been discussed and presented with examples. Further, some ambiguous cases in POS annotation are presented in the paper.
2019
pdf
bib
Introduction to Sanskrit Shabdamitra: An Educational Application of Sanskrit Wordnet
Malhar Kulkarni
|
Nilesh Joshi
|
Sayali Khare
|
Hanumant Redkar
|
Pushpak Bhattacharyya
Proceedings of the 6th International Sanskrit Computational Linguistics Symposium
2018
pdf
bib
abs
Hindi Wordnet for Language Teaching: Experiences and Lessons Learnt
Hanumant Redkar
|
Rajita Shukla
|
Sandhya Singh
|
Jaya Saraswati
|
Laxmi Kashyap
|
Diptesh Kanojia
|
Preethi Jyothi
|
Malhar Kulkarni
|
Pushpak Bhattacharyya
Proceedings of the 9th Global Wordnet Conference
This paper reports the work related to making Hindi Wordnet1 available as a digital resource for language learning and teaching, and the experiences and lessons that were learnt during the process. The language data of the Hindi Wordnet has been suitably modified and enhanced to make it into a language learning aid. This aid is based on modern pedagogical axioms and is aligned to the learning objectives of the syllabi of the school education in India. To make it into a comprehensive language tool, grammatical information has also been encoded, as far as these can be marked on the lexical items. The delivery of information is multi-layered, multi-sensory and is available across multiple digital platforms. The front end has been designed to offer an eye-catching user-friendly interface which is suitable for learners starting from age six onward. Preliminary testing of the tool has been done and it has been modified as per the feedbacks that were received. Above all, the entire exercise has offered gainful insights into learning based on associative networks and how knowledge based on such networks can be made available to modern learners.
2017
pdf
bib
abs
Hindi Shabdamitra: A Wordnet based E-Learning Tool for Language Learning and Teaching
Hanumant Redkar
|
Sandhya Singh
|
Meenakshi Somasundaram
|
Dhara Gorasia
|
Malhar Kulkarni
|
Pushpak Bhattacharyya
Proceedings of the 4th Workshop on Natural Language Processing Techniques for Educational Applications (NLPTEA 2017)
In today’s technology driven digital era, education domain is undergoing a transformation from traditional approaches to more learner controlled and flexible methods of learning. This transformation has opened the new avenues for interdisciplinary research in the field of educational technology and natural language processing in developing quality digital aids for learning and teaching. The tool presented here - Hindi Shabhadamitra, developed using Hindi Wordnet for Hindi language learning, is one such e-learning tool. It has been developed as a teaching and learning aid suitable for formal school based curriculum and informal setup for self learning users. Besides vocabulary, it also provides word based grammar along with images and pronunciation for better learning and retention. This aid demonstrates that how a rich lexical resource like wordnet can be systematically remodeled for practical usage in the educational domain.
pdf
bib
Hindi Shabdamitra: A Wordnet based E-Learning Tool for Language Learning and Teaching
Hanumant Redkar
|
Sandhya Singh
|
Dhara Gorasia
|
Meenakshi Somasundaram
|
Malhar Kulkarni
|
Pushpak Bhattacharyya
Proceedings of the 14th International Conference on Natural Language Processing (ICON-2017)
2016
pdf
bib
Verbframator:Semi-Automatic Verb Frame Annotator Tool with Special Reference to Marathi
Hanumant Redkar
|
Sandhya Singh
|
Nandini Ghag
|
Jai Paranjape
|
Nilesh Joshi
|
Malhar Kulkarni
|
Pushpak Bhattacharyya
Proceedings of the 13th International Conference on Natural Language Processing
pdf
bib
abs
IndoWordNet::Similarity- Computing Semantic Similarity and Relatedness using IndoWordNet
Sudha Bhingardive
|
Hanumant Redkar
|
Prateek Sappadla
|
Dhirendra Singh
|
Pushpak Bhattacharyya
Proceedings of the 8th Global WordNet Conference (GWC)
Semantic similarity and relatedness measures play an important role in natural language processing applications. In this paper, we present the IndoWordNet::Similarity tool and interface, designed for computing the semantic similarity and relatedness between two words in IndoWordNet. A java based tool and a web interface have been developed to compute this semantic similarity and relatedness. Also, Java API has been developed for this purpose. This tool, web interface and the API are made available for the research purpose.
pdf
bib
abs
Samāsa-Kartā: An Online Tool for Producing Compound Words using IndoWordNet
Hanumant Redkar
|
Nilesh Joshi
|
Sandhya Singh
|
Irawati Kulkarni
|
Malhar Kulkarni
|
Pushpak Bhattacharyya
Proceedings of the 8th Global WordNet Conference (GWC)
Samāsa or compounds are a regular feature of Indian Languages. They are also found in other languages like German, Italian, French, Russian, Spanish, etc. Compound word is constructed from two or more words to form a single word. The meaning of this word is derived from each of the individual words of the compound. To develop a system to generate, identify and interpret compounds, is an important task in Natural Language Processing. This paper introduces a web based tool - Samāsa-Kartā for producing compound words. Here, the focus is on Sanskrit language due to its richness in usage of compounds; however, this approach can be applied to any Indian language as well as other languages. IndoWordNet is used as a resource for words to be compounded. The motivation behind creating compound words is to create, to improve the vocabulary, to reduce sense ambiguity, etc. in order to enrich the WordNet. The Samāsa-Kartā can be used for various applications viz., compound categorization, sandhi creation, morphological analysis, paraphrasing, synset creation, etc.
2015
pdf
bib
Unsupervised Most Frequent Sense Detection using Word Embeddings
Sudha Bhingardive
|
Dhirendra Singh
|
Rudramurthy V
|
Hanumant Redkar
|
Pushpak Bhattacharyya
Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
pdf
bib
IndoWordNet Dictionary: An Online Multilingual Dictionary using IndoWordNet
Hanumant Redkar
|
Sandhya Singh
|
Nilesh Joshi
|
Anupam Ghosh
|
Pushpak Bhattacharyya
Proceedings of the 12th International Conference on Natural Language Processing
2014
pdf
bib
Introduction to Synskarta: An Online Interface for Synset Creation with Special Reference to Sanskrit
Hanumant Redkar
|
Jai Paranjape
|
Nilesh Joshi
|
Irawati Kulkarni
|
Malhar Kulkarni
|
Pushpak Bhattacharyya
Proceedings of the 11th International Conference on Natural Language Processing
2012
pdf
bib
An Efficient Database Design for IndoWordNet Development Using Hybrid Approach
Venkatesh Prabhu
|
Shilpa Desai
|
Hanumant Redkar
|
Neha Prabhugaonkar
|
Apurva Nagvenkar
|
Ramdas Karmali
Proceedings of the 3rd Workshop on South and Southeast Asian Natural Language Processing