Language Models Learn Universal Representations of Numbers and Here’s Why You Should Care

Michal Štefánik; Timothee Mickus; Marek Kadlčík; Bertram Højer; Michal Spiegel; Raúl Vázquez; Aman Sinha; Josef Kuchař; Philipp Mondorf; Pontus Stenetorp

Language Models Learn Universal Representations of Numbers and Here’s Why You Should Care

Michal Štefánik, Timothee Mickus, Marek Kadlčík, Bertram Højer, Michal Spiegel, Raúl Vázquez, Aman Sinha, Josef Kuchař, Philipp Mondorf, Pontus Stenetorp

Abstract

Prior work has shown that large language models (LLMs) often converge to accurate input embedding for numbers, based on sinusoidal representations.In this work, we demonstrate that these representations are in fact strikingly systematic, to the point of being almost perfectly universal: different LLM families develop equivalent sinusoidal structures, and number representations are broadly interchangeable in a large swathe of experimental setups.We show that properly factoring in this characteristic is crucial when it comes to assessing how accurately LLMs encode numeric and other ordinal information, and that mechanistically enhancing this sinusoidality can also lead to reductions of LLMs’ arithmetic errors.

Anthology ID:: 2026.acl-long.1415
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 30663–30681
Language:
URL:: https://aclanthology.org/2026.acl-long.1415/
DOI:
Bibkey:
Cite (ACL):: Michal Štefánik, Timothee Mickus, Marek Kadlčík, Bertram Højer, Michal Spiegel, Raúl Vázquez, Aman Sinha, Josef Kuchař, Philipp Mondorf, and Pontus Stenetorp. 2026. Language Models Learn Universal Representations of Numbers and Here’s Why You Should Care. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 30663–30681, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Language Models Learn Universal Representations of Numbers and Here’s Why You Should Care (Štefánik et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.1415.pdf
Checklist:: 2026.acl-long.1415.checklist.pdf

PDF Cite Search Checklist Fix data