How to Leverage Digit Embeddings to Represent Numbers?

Jasivan Alex Sivakumar, Nafise Sadat Moosavi


Abstract
Within numerical reasoning, understanding numbers themselves is still a challenge for existing language models. Simple generalisations, such as solving 100+200 instead of 1+2, can substantially affect model performance (Sivakumar and Moosavi, 2023). Among various techniques, character-level embeddings of numbers have emerged as a promising approach to improve number representation. However, this method has limitations as it leaves the task of aggregating digit representations to the model, which lacks direct supervision for this process. In this paper, we explore the use of mathematical priors to compute aggregated digit embeddings and explicitly incorporate these aggregates into transformer models. This can be achieved either by adding a special token to the input embeddings or by introducing an additional loss function to enhance correct predictions. We evaluate the effectiveness of incorporating this explicit aggregation, analysing its strengths and shortcomings, and discuss future directions to better benefit from this approach. Our methods, while simple, are compatible with any pretrained model, easy to implement, and have been made publicly available.
Anthology ID:
2025.coling-main.514
Volume:
Proceedings of the 31st International Conference on Computational Linguistics
Month:
January
Year:
2025
Address:
Abu Dhabi, UAE
Editors:
Owen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa, Barbara Di Eugenio, Steven Schockaert
Venue:
COLING
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
7685–7697
Language:
URL:
https://aclanthology.org/2025.coling-main.514/
DOI:
Bibkey:
Cite (ACL):
Jasivan Alex Sivakumar and Nafise Sadat Moosavi. 2025. How to Leverage Digit Embeddings to Represent Numbers?. In Proceedings of the 31st International Conference on Computational Linguistics, pages 7685–7697, Abu Dhabi, UAE. Association for Computational Linguistics.
Cite (Informal):
How to Leverage Digit Embeddings to Represent Numbers? (Sivakumar & Moosavi, COLING 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.coling-main.514.pdf