Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning

Jingyang Lin; Andy Wong; Tian Xia; Shenghua He; Hui Wei; Mei Han; Jiebo Luo

doi:10.18653/v1/2025.emnlp-main.615

Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning

Jingyang Lin, Andy Wong, Tian Xia, Shenghua He, Hui Wei, Mei Han, Jiebo Luo

Abstract

Recent advances in Large Language Models (LLMs) have enabled them to process increasingly longer sequences, ranging from 2K to 2M tokens and even beyond. However, simply extending the input sequence length does not necessarily lead to effective long-context understanding. In this study, we integrate Chain-of-Thought (CoT) reasoning into LLMs in a supervised manner to facilitate effective long-context understanding. To achieve this, we introduce LongFinanceQA, a synthetic dataset in the financial domain designed to improve long-context reasoning. Unlike existing long-context synthetic data, LongFinanceQA includes intermediate CoT reasoning before the final conclusion, which encourages LLMs to perform explicit reasoning, improving accuracy and interpretability in long-context understanding. To generate synthetic CoT reasoning, we propose Property-based Agentic Inference (PAI), an agentic framework that simulates human-like reasoning steps, including property extraction, retrieval, and summarization. We evaluate PAI’s reasoning capabilities by assessing GPT-4o-mini w/ PAI on the Loong benchmark, outperforming standard GPT-4o-mini by 20.0%. Furthermore, we fine-tune LLaMA-3.1-8B-Instruct on LongFinanceQA, achieving a 28.0% gain on Loong’s financial subset.

Anthology ID:: 2025.emnlp-main.615
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 12232–12248
Language:
URL:: https://aclanthology.org/2025.emnlp-main.615/
DOI:: 10.18653/v1/2025.emnlp-main.615
Bibkey:
Cite (ACL):: Jingyang Lin, Andy Wong, Tian Xia, Shenghua He, Hui Wei, Mei Han, and Jiebo Luo. 2025. Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 12232–12248, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning (Lin et al., EMNLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.emnlp-main.615.pdf
Checklist:: 2025.emnlp-main.615.checklist.pdf

PDF Cite Search Checklist Fix data