Do LLMs Encode Functional Importance of Reasoning Tokens ?

Janvijay Singh; Dilek Hakkani-Tur

Do LLMs Encode Functional Importance of Reasoning Tokens ?

Abstract

Large language models solve complex tasks by generating long reasoning chains, achieving higher accuracy at the cost of increased computational cost and reduced ability to isolate functionally relevant reasoning. Prior work on compact reasoning shortens such chains through probabilistic sampling, heuristics, or supervision from frontier models, but offers limited insight into whether models internally encode token-level functional importance for answer generation. We address this gap diagnostically and propose greedy pruning, a likelihood-preserving deletion procedure that iteratively removes reasoning tokens whose removal minimally degrades model likelihood under a specified objective, yielding length-controlled reasoning chains. We evaluate pruned reasoning in a distillation framework and show that students trained on pruned chains outperform a frontier-model–supervised compression baseline at matched reasoning lengths. Finally, our analysis reveals systematic pruning patterns and shows that attention scores can predict greedy pruning ranks, further suggesting that models encode a nontrivial functional importance structure over reasoning tokens.

Anthology ID:: 2026.acl-long.1419
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 30749–30773
Language:
URL:: https://aclanthology.org/2026.acl-long.1419/
DOI:
Bibkey:
Cite (ACL):: Janvijay Singh and Dilek Hakkani-Tür. 2026. Do LLMs Encode Functional Importance of Reasoning Tokens ?. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 30749–30773, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Do LLMs Encode Functional Importance of Reasoning Tokens ? (Singh & Hakkani-Tür, ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.1419.pdf
Checklist:: 2026.acl-long.1419.checklist.pdf

PDF Cite Search Checklist Fix data