Assessing the State of the Art in Scene Segmentation

Albin Zehe; Elisabeth Fischer; Andreas Hotho

doi:10.18653/v1/2025.naacl-long.500

Assessing the State of the Art in Scene Segmentation

Albin Zehe, Elisabeth Fischer, Andreas Hotho

Abstract

The detection of scenes in literary texts is a recently introduced segmentation task in computational literary studies. Its goal is to partition a fictional text into segments that are coherent across the dimensions time, space, action and character constellation. This task is very challenging for automatic methods, since it requires a high-level understanding of the text. In this paper, we provide a thorough analysis of the State of the Art and challenges in this task, identifying and solving a problem in the training procedure for previous approaches, analysing the generalisation capabilities of the models and comparing the BERT-based SotA to current Llama models, as well as providing an analysis of what causes errors in the models. Our change in training procedure provides a significant increase in performance. We find that Llama-based models are more robust to different types of texts, while their overall performance is slightly worse than that of BERT-based models.

Anthology ID:: 2025.naacl-long.500
Volume:: Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
Month:: April
Year:: 2025
Address:: Albuquerque, New Mexico
Editors:: Luis Chiruzzo, Alan Ritter, Lu Wang
Venue:: NAACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 9922–9941
Language:
URL:: https://aclanthology.org/2025.naacl-long.500/
DOI:: 10.18653/v1/2025.naacl-long.500
Bibkey:
Cite (ACL):: Albin Zehe, Elisabeth Fischer, and Andreas Hotho. 2025. Assessing the State of the Art in Scene Segmentation. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), pages 9922–9941, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):: Assessing the State of the Art in Scene Segmentation (Zehe et al., NAACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.naacl-long.500.pdf

PDF Cite Search Fix data