The Role of Natural Language Processing Tasks in Automatic Literary Character Network Construction

Arthur Amalvy, Vincent Labatut, Richard Dufour


Abstract
The automatic extraction of character networks from literary texts is generally carried out using natural language processing (NLP) cascading pipelines. While this approach is widespread, no study exists on the impact of low-level NLP tasks on their performance. In this article, we conduct such a study on a literary dataset, focusing on the role of named entity recognition (NER) and coreference resolution when extracting co-occurrence networks. To highlight the impact of these tasks’ performance, we start with gold-standard annotations, progressively add uniformly distributed errors, and observe their impact in terms of character network quality. We demonstrate that NER performance depends on the tested novel and strongly affects character detection. We also show that NER-detected mentions alone miss a lot of character co-occurrences, and that coreference resolution is needed to prevent this. Finally, we present comparison points with 2 methods based on large language models (LLMs), including a fully end-to-end one, and show that these models are outperformed by traditional NLP pipelines in terms of recall.
Anthology ID:
2025.coling-main.566
Volume:
Proceedings of the 31st International Conference on Computational Linguistics
Month:
January
Year:
2025
Address:
Abu Dhabi, UAE
Editors:
Owen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa, Barbara Di Eugenio, Steven Schockaert
Venue:
COLING
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
8462–8473
Language:
URL:
https://aclanthology.org/2025.coling-main.566/
DOI:
Bibkey:
Cite (ACL):
Arthur Amalvy, Vincent Labatut, and Richard Dufour. 2025. The Role of Natural Language Processing Tasks in Automatic Literary Character Network Construction. In Proceedings of the 31st International Conference on Computational Linguistics, pages 8462–8473, Abu Dhabi, UAE. Association for Computational Linguistics.
Cite (Informal):
The Role of Natural Language Processing Tasks in Automatic Literary Character Network Construction (Amalvy et al., COLING 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.coling-main.566.pdf