Steering Language Models in Multi-Token Generation: A Case Study on Tense and Aspect

Alina Klerings; Jannik Brinkmann; Daniel Ruffinelli; Simone Paolo Ponzetto

doi:10.18653/v1/2025.emnlp-main.435

Steering Language Models in Multi-Token Generation: A Case Study on Tense and Aspect

Alina Klerings, Jannik Brinkmann, Daniel Ruffinelli, Simone Paolo Ponzetto

Abstract

Large language models (LLMs) are able to generate grammatically well-formed text, but how do they encode their syntactic knowledge internally? While prior work has focused largely on binary grammatical contrasts, in this work, we study the representation and control of two multidimensional hierarchical grammar phenomena—verb tense and aspect—and for each, identify distinct, orthogonal directions in residual space using linear discriminant analysis. Next, we demonstrate causal control over both grammatical features through concept steering across three generation tasks. Then, we use these identified features in a case study to investigate factors influencing effective steering in multi-token generation. We find that steering strength, location, and duration are crucial parameters for reducing undesirable side effects such as topic shift and degeneration. Our findings suggest that models encode tense and aspect in structurally organized, human-like ways, but effective control of such features during generation is sensitive to multiple factors and requires manual tuning or automated optimization.

Anthology ID:: 2025.emnlp-main.435
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 8621–8639
Language:
URL:: https://aclanthology.org/2025.emnlp-main.435/
DOI:: 10.18653/v1/2025.emnlp-main.435
Bibkey:
Cite (ACL):: Alina Klerings, Jannik Brinkmann, Daniel Ruffinelli, and Simone Paolo Ponzetto. 2025. Steering Language Models in Multi-Token Generation: A Case Study on Tense and Aspect. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 8621–8639, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Steering Language Models in Multi-Token Generation: A Case Study on Tense and Aspect (Klerings et al., EMNLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.emnlp-main.435.pdf
Checklist:: 2025.emnlp-main.435.checklist.pdf

PDF Cite Search Checklist Fix data