On the Effect of Hyperparameters in Language Modeling for Computational Linguistics

Ruoxi Ning; Yongpeng Zhu; Qingcheng Zeng; Tatsuki Kuribayashi; Freda Shi

On the Effect of Hyperparameters in Language Modeling for Computational Linguistics

Ruoxi Ning, Yongpeng Zhu, Qingcheng Zeng, Tatsuki Kuribayashi, Freda Shi

Abstract

Training language models and examining their linguistic behaviors have been a common protocol in computational linguistics for studying linguistic phenomena and modeling human language processing. However, work in this area is often limited to proof-of-concept demonstrations with arbitrary model configurations, without considering hyperparameter sensitivity, an important source of variation in model performance. In this work, we replicate three prior studies (Chang and Bergen, 2022; Hu et al., 2020b; Kuribayashi et al., 2024) with hyperparameters varied within a practical range, and show that modest hyperparameter changes can alter some qualitative conclusions about models’ linguistic abilities and even reverse the ranking of model performance. Our results highlight the risk that prior work may have reflected optimization artifacts rather than the genuine inductive biases of model classes, and that hyperparameter sensitivity should receive more attention as a factor that can meaningfully influence model behavior. We suggest future work to report the variation of performance across the configuration space to enhance the reliability and generalizability of conclusions. Code: https://github.com/compling-wat/tune-linguistic-lms.

Anthology ID:: 2026.acl-long.1939
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 41863–41880
Language:
URL:: https://aclanthology.org/2026.acl-long.1939/
DOI:
Bibkey:
Cite (ACL):: Ruoxi Ning, Yongpeng Zhu, Qingcheng Zeng, Tatsuki Kuribayashi, and Freda Shi. 2026. On the Effect of Hyperparameters in Language Modeling for Computational Linguistics. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 41863–41880, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: On the Effect of Hyperparameters in Language Modeling for Computational Linguistics (Ning et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.1939.pdf
Checklist:: 2026.acl-long.1939.checklist.pdf

PDF Cite Search Checklist Fix data