Evaluating LLM Performance in Character Analysis: A Study of Artificial Beings in Recent Korean Science Fiction

Woori Jang, Seohyon Jung


Abstract
Literary works present diverse and complex character behaviors, often implicit or intentionally obscured, making character analysis an inherently challenging task. This study explores LLMs’ capability to identify and interpret behaviors of artificial beings in 11 award-winning contemporary Korean science fiction short stories. Focusing on artificial beings as a distinct class of characters, rather than on conventional human characters, adds to the multi-layered complexity of analysis. We compared two LLMs, Claude 3.5 Sonnet and GPT-4o, with human experts using a custom eight-label system and a unique agreement metric developed to capture the cognitive intricacies of literary interpretation. Human inter-annotator agreement was around 50%, confirming the subjectivity of literary comprehension. LLMs differed from humans in selected text spans but demonstrated high agreement in label assignment for correctly identified spans. LLMs notably excelled at discerning ‘actions’ as semantic units rather than isolated grammatical components. This study reaffirms literary interpretation’s multifaceted nature while expanding the boundaries of NLP, contributing to discussions about AI’s capacity to understand and interpret creative works.
Anthology ID:
2024.nlp4dh-1.34
Volume:
Proceedings of the 4th International Conference on Natural Language Processing for Digital Humanities
Month:
November
Year:
2024
Address:
Miami, USA
Editors:
Mika Hämäläinen, Emily Öhman, So Miyagawa, Khalid Alnajjar, Yuri Bizzoni
Venue:
NLP4DH
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
339–351
Language:
URL:
https://aclanthology.org/2024.nlp4dh-1.34
DOI:
Bibkey:
Cite (ACL):
Woori Jang and Seohyon Jung. 2024. Evaluating LLM Performance in Character Analysis: A Study of Artificial Beings in Recent Korean Science Fiction. In Proceedings of the 4th International Conference on Natural Language Processing for Digital Humanities, pages 339–351, Miami, USA. Association for Computational Linguistics.
Cite (Informal):
Evaluating LLM Performance in Character Analysis: A Study of Artificial Beings in Recent Korean Science Fiction (Jang & Jung, NLP4DH 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.nlp4dh-1.34.pdf