Automatic Authorship Classification for German Lyrics Using Naïve Bayes

Akshay Mendhakar, Mesian Tilmatine


Abstract
Text classification is a prevalent and essential machine-learning task. Machine learning classifiers have developed immensely since their inception. The naïve Bayes classifier is one of the most prominent supervised machine learning classifiers. In this experiment, we highlight the performance of Naïve Bayes for classifying of authors/artists on the German lyrics corpus (“Songkorpus”) and compare the classification results with other classifier algorithms. The corpus of investigation consists of six artists with 970 songs in total. Bayes model evaluation measures revealed a precision of 0.91, recall of 0.94, and F1-measure of 0.9. Furthermore, the classification performance with other classifier algorithms did not reveal any statistically significant difference in performance. The results of the study add to the high volume of reports on the classification accuracy of Naive Bayes for the task of lyrical classification.
Anthology ID:
2023.jlcl-1.9
Volume:
Journal for Language Technology and Computational Linguistics, Vol. 36 No. 1
Month:
May
Year:
2023
Address:
unknown
Editors:
Roman Schneider, Faaß Gertrud
Venue:
JLCL
SIG:
Publisher:
German Society for Computational Lingustics and Language Technology
Note:
Pages:
171–182
Language:
URL:
https://aclanthology.org/2023.jlcl-1.9
DOI:
10.21248/jlcl.36.2023.242
Bibkey:
Cite (ACL):
Akshay Mendhakar and Mesian Tilmatine. 2023. Automatic Authorship Classification for German Lyrics Using Naïve Bayes. In Journal for Language Technology and Computational Linguistics, Vol. 36 No. 1, pages 171–182, unknown. German Society for Computational Lingustics and Language Technology.
Cite (Informal):
Automatic Authorship Classification for German Lyrics Using Naïve Bayes (Mendhakar & Tilmatine, JLCL 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.jlcl-1.9.pdf