Samāsa-Kartā: An Online Tool for Producing Compound Words using IndoWordNet

Hanumant Redkar, Nilesh Joshi, Sandhya Singh, Irawati Kulkarni, Malhar Kulkarni, Pushpak Bhattacharyya


Abstract
Samāsa or compounds are a regular feature of Indian Languages. They are also found in other languages like German, Italian, French, Russian, Spanish, etc. Compound word is constructed from two or more words to form a single word. The meaning of this word is derived from each of the individual words of the compound. To develop a system to generate, identify and interpret compounds, is an important task in Natural Language Processing. This paper introduces a web based tool - Samāsa-Kartā for producing compound words. Here, the focus is on Sanskrit language due to its richness in usage of compounds; however, this approach can be applied to any Indian language as well as other languages. IndoWordNet is used as a resource for words to be compounded. The motivation behind creating compound words is to create, to improve the vocabulary, to reduce sense ambiguity, etc. in order to enrich the WordNet. The Samāsa-Kartā can be used for various applications viz., compound categorization, sandhi creation, morphological analysis, paraphrasing, synset creation, etc.
Anthology ID:
2016.gwc-1.46
Volume:
Proceedings of the 8th Global WordNet Conference (GWC)
Month:
27--30 January
Year:
2016
Address:
Bucharest, Romania
Editors:
Christiane Fellbaum, Piek Vossen, Verginica Barbu Mititelu, Corina Forascu
Venue:
GWC
SIG:
SIGLEX
Publisher:
Global Wordnet Association
Note:
Pages:
325–332
Language:
URL:
https://aclanthology.org/2016.gwc-1.46
DOI:
Bibkey:
Cite (ACL):
Hanumant Redkar, Nilesh Joshi, Sandhya Singh, Irawati Kulkarni, Malhar Kulkarni, and Pushpak Bhattacharyya. 2016. Samāsa-Kartā: An Online Tool for Producing Compound Words using IndoWordNet. In Proceedings of the 8th Global WordNet Conference (GWC), pages 325–332, Bucharest, Romania. Global Wordnet Association.
Cite (Informal):
Samāsa-Kartā: An Online Tool for Producing Compound Words using IndoWordNet (Redkar et al., GWC 2016)
Copy Citation:
PDF:
https://aclanthology.org/2016.gwc-1.46.pdf