Boundary Matters: Leveraging Structured Text Plots for Long Text Outline Generation

Yuanchi Ma, Jiamou Liu, Hui He, Libo Zhang, Haoyuan Li, Zhendong Niu


Abstract
Outline generation aims to uncover the internal content structure of a document by identifying potential chapter connections and generating corresponding summaries. A robust outline generation model strives for coherence between and within plots. However, existing methods perform well on short- and medium-length texts and struggle with generating readable outlines for very long texts (e.g., fictional literary works). The primary challenge lies in their inability to accurately segment plots within long texts. To address this issue, we propose a novel unsupervised guidance framework, LeStrTP, to guide large language model (LLM) outline generation. This framework ensures that each structured plot encapsulates complete causality by accurately identifying plot boundaries. Specifically, the LeStrTP framework constructs chapter-level graph from long texts and learns their embeddings. Subsequently, through Markov chain modeling chapter dependence, a unique search operator is designed to achieve plot segmentation. To facilitate research on this task, we introduce a new annotated benchmark dataset, NovOutlineSet. Experimental results demonstrate that structured plots not only enhance the coherence and integrity of generated outlines but also significantly improve their quality.
Anthology ID:
2025.findings-emnlp.4
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2025
Month:
November
Year:
2025
Address:
Suzhou, China
Editors:
Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
49–63
Language:
URL:
https://aclanthology.org/2025.findings-emnlp.4/
DOI:
Bibkey:
Cite (ACL):
Yuanchi Ma, Jiamou Liu, Hui He, Libo Zhang, Haoyuan Li, and Zhendong Niu. 2025. Boundary Matters: Leveraging Structured Text Plots for Long Text Outline Generation. In Findings of the Association for Computational Linguistics: EMNLP 2025, pages 49–63, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):
Boundary Matters: Leveraging Structured Text Plots for Long Text Outline Generation (Ma et al., Findings 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.findings-emnlp.4.pdf
Checklist:
 2025.findings-emnlp.4.checklist.pdf