Modeling Moravian Memoirs: Ternary Sentiment Analysis in a Low Resource Setting

Patrick Brookshire, Nils Reiter


Abstract
The Moravians are a Christian group that has emerged from a 15th century movement. In this paper, we investigate how memoirs written by the devotees of this group can be analyzed with methods from computational linguistics, in particular sentiment analysis. To this end, we experiment with two different fine-tuning strategies and find that the best performance for ternary sentiment analysis (81% accuracy) is achieved by fine-tuning a German BERT model, outperforming in particular models trained on much larger German sentiment datasets. We further investigate the model(s) using SHAP scores and find that the best performing model struggles with multiple negations and mixed statements. Finally, we show two application scenarios motivated by research questions from religious studies.
Anthology ID:
2024.latechclfl-1.10
Volume:
Proceedings of the 8th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature (LaTeCH-CLfL 2024)
Month:
March
Year:
2024
Address:
St. Julians, Malta
Editors:
Yuri Bizzoni, Stefania Degaetano-Ortlieb, Anna Kazantseva, Stan Szpakowicz
Venues:
LaTeCHCLfL | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
91–100
Language:
URL:
https://aclanthology.org/2024.latechclfl-1.10
DOI:
Bibkey:
Cite (ACL):
Patrick Brookshire and Nils Reiter. 2024. Modeling Moravian Memoirs: Ternary Sentiment Analysis in a Low Resource Setting. In Proceedings of the 8th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature (LaTeCH-CLfL 2024), pages 91–100, St. Julians, Malta. Association for Computational Linguistics.
Cite (Informal):
Modeling Moravian Memoirs: Ternary Sentiment Analysis in a Low Resource Setting (Brookshire & Reiter, LaTeCHCLfL-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.latechclfl-1.10.pdf