MalUpama - Figurative Language Identification in Malayalam -An Experimental Study

Reenu Paul, Wincy Abraham, Anitha S. Pillai


Abstract
Figurative language, particularly in under represented languages within the Dravidian family, serves as a critical medium for conveying emotions and cultural meaning. Despite the rich literary traditions of languages such as Malayalam, Tamil, Telugu, and Kannada, there has been minimal progress in developing computational techniques to analyze figurative expressions. Historically, Malayalam was known by various names, such as Malayanma and Malabari. Similarly Kerala was known as Malanadu before adopting its current name, which metaphorically refers to the land between the Indian Ocean and the Western Ghats. In this study, we introduce the UPAMA Model(MalUpama), designed to identify Similes in Malayalam, an under-resourced Dravidian language mostly spoken in the state of southern India, Kerala. The current research focuses on detection of presence of Simile in Malayalam prose using the ‘Upama model’. This paper outlines the detection of Simile in Malayalam sentences and a detection accuracy of 94.5% is achieved by the proposed method. To the best of our knowledge this is the first work in the Malayalam language, explores computational techniques with a particular focus on applying machine learning to analyze figurative expressions which can be adopted for other Dravidian Languages too. The dataset developed for this study is made publicly available, allowing scholars to contribute and explore more on the category ‘Upama’ of Figurative Languages (‘Alankarangal’) of Malayalam language.
Anthology ID:
2024.icon-1.41
Volume:
Proceedings of the 21st International Conference on Natural Language Processing (ICON)
Month:
December
Year:
2024
Address:
AU-KBC Research Centre, Chennai, India
Editors:
Sobha Lalitha Devi, Karunesh Arora
Venue:
ICON
SIG:
Publisher:
NLP Association of India (NLPAI)
Note:
Pages:
357–367
Language:
URL:
https://aclanthology.org/2024.icon-1.41/
DOI:
Bibkey:
Cite (ACL):
Reenu Paul, Wincy Abraham, and Anitha S. Pillai. 2024. MalUpama - Figurative Language Identification in Malayalam -An Experimental Study. In Proceedings of the 21st International Conference on Natural Language Processing (ICON), pages 357–367, AU-KBC Research Centre, Chennai, India. NLP Association of India (NLPAI).
Cite (Informal):
MalUpama - Figurative Language Identification in Malayalam -An Experimental Study (Paul et al., ICON 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.icon-1.41.pdf