Automatic Sanskrit Poetry Classification Based on Kāvyaguṇa

Amruta Barbadikar, Amba Kulkarni


Abstract
Kāvyaguṇa denotes the syntactic and phonetic attributes or qualities of Sanskrit poetry that enhance its artistic appeal, commonly classified into three categories: Mādhyurya (Sweetness), Oja (Floridity), and Prasāda (Lucidity). This paper presents the Kāvyaguṇa Classifier, a machine learning module, designed to classify Sanskrit literary texts into three distinct guṇas, by employing a diverse range of machine learning algorithms, including Random Forest, Gradient Boosting, XGBoost, Multi-Layer Perceptron and Support Vector Machine. For vectorization, we employed two methods: the neural network-based Word2vec and a custom feature engineering approach grounded in the theoretical understanding of Kāvyaguṇas as described in Sanskrit poetics. The feature engineering model significantly outperformed, achieving an accuracy of up to 90.6%
Anthology ID:
2024.icon-1.13
Volume:
Proceedings of the 21st International Conference on Natural Language Processing (ICON)
Month:
December
Year:
2024
Address:
AU-KBC Research Centre, Chennai, India
Editors:
Sobha Lalitha Devi, Karunesh Arora
Venue:
ICON
SIG:
Publisher:
NLP Association of India (NLPAI)
Note:
Pages:
109–119
Language:
URL:
https://aclanthology.org/2024.icon-1.13/
DOI:
Bibkey:
Cite (ACL):
Amruta Barbadikar and Amba Kulkarni. 2024. Automatic Sanskrit Poetry Classification Based on Kāvyaguṇa. In Proceedings of the 21st International Conference on Natural Language Processing (ICON), pages 109–119, AU-KBC Research Centre, Chennai, India. NLP Association of India (NLPAI).
Cite (Informal):
Automatic Sanskrit Poetry Classification Based on Kāvyaguṇa (Barbadikar & Kulkarni, ICON 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.icon-1.13.pdf