From Stories to Statistics: Methodological Biases in LLM-Based Narrative Flow Quantification

Amal Sunny; Advay Gupta; Yashashree Chandak; Vishnu Sreekumar

doi:10.18653/v1/2025.conll-1.14

From Stories to Statistics: Methodological Biases in LLM-Based Narrative Flow Quantification

Amal Sunny, Advay Gupta, Yashashree Chandak, Vishnu Sreekumar

Abstract

Large Language Models (LLMs) have made significant contributions to cognitive science research. One area of application is narrative understanding. Sap et al. (2022) introduced sequentiality, an LLM-derived measure that assesses the coherence of a story based on word probability distributions. They reported that recalled stories flowed less sequentially than imagined stories. However, the robustness and generalizability of this narrative flow measure remain unverified. To assess generalizability, we apply sequentiality derived from three different LLMs to a new dataset of matched autobiographical and biographical paragraphs. Contrary to previous results, we fail to find a significant difference in narrative flow between autobiographies and biographies. Further investigation reveals biases in the original data collection process, where topic selection systematically influences sequentiality scores. Adjusting for these biases substantially reduces the originally reported effect size. A validation exercise using LLM-generated stories with “good” and “poor” flow further highlights the flaws in the original formulation of sequentiality. Our findings suggest that LLM-based narrative flow quantification is susceptible to methodological artifacts. Finally, we provide some suggestions for modifying the sequentiality formula to accurately capture narrative flow.

Anthology ID:: 2025.conll-1.14
Volume:: Proceedings of the 29th Conference on Computational Natural Language Learning
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Gemma Boleda, Michael Roth
Venues:: CoNLL | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 201–215
Language:
URL:: https://aclanthology.org/2025.conll-1.14/
DOI:: 10.18653/v1/2025.conll-1.14
Bibkey:
Cite (ACL):: Amal Sunny, Advay Gupta, Yashashree Chandak, and Vishnu Sreekumar. 2025. From Stories to Statistics: Methodological Biases in LLM-Based Narrative Flow Quantification. In Proceedings of the 29th Conference on Computational Natural Language Learning, pages 201–215, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: From Stories to Statistics: Methodological Biases in LLM-Based Narrative Flow Quantification (Sunny et al., CoNLL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.conll-1.14.pdf

PDF Cite Search Fix data