Look Before You Leap: A Lookahead Reasoning Quality Gate for Speculative Decoding

Hiroaki Kingetsu; Kaoru Yokoo; Kenji Fukumizu; Manohar Kaul

Look Before You Leap: A Lookahead Reasoning Quality Gate for Speculative Decoding

Hiroaki Kingetsu, Kaoru Yokoo, Kenji Fukumizu, Manohar Kaul

Abstract

We present a lookahead quality gate (verifier) for speculative decoding for reasoning or chain-of-thought language models. The gate accepts the longest reliable prefix of each k-token lookahead (block-wise) draft. Unlike token-level likelihood search, which is myopic and often rewards verbosity, or tree-level sampling methods that trade accuracy for latency, our approach works at an intermediate granularity. It uses only the base model’s hidden states to compute a geometry-based quality score for each prefix, then accepts the longest prefix whose score exceeds a quantile-calibrated threshold estimated from unlabeled prompts. The method integrates seamlessly with speculative/blockwise decoding and adds minimal runtime overhead, requiring no auxiliary heads, reward models, or finetuning. On math and science benchmarks, it improves accuracy over sampling baselines while achieving 2.6-7.9× faster generation.

Anthology ID:: 2026.eacl-long.367
Volume:: Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Vera Demberg, Kentaro Inui, Lluís Marquez
Venue:: EACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 7831–7847
Language:
URL:: https://aclanthology.org/2026.eacl-long.367/
DOI:
Bibkey:
Cite (ACL):: Hiroaki Kingetsu, Kaoru Yokoo, Kenji Fukumizu, and Manohar Kaul. 2026. Look Before You Leap: A Lookahead Reasoning Quality Gate for Speculative Decoding. In Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), pages 7831–7847, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: Look Before You Leap: A Lookahead Reasoning Quality Gate for Speculative Decoding (Kingetsu et al., EACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.eacl-long.367.pdf
Checklist:: 2026.eacl-long.367.checklist.pdf

PDF Cite Search Checklist Fix data