Can a Remedy Find a Researcher? Exploring the Development of Semantic Knowledge in Italian BabyLMs

Alice Suozzi; Luca Capone; Gianluca E. Lebani; Alessandro Lenci

Can a Remedy Find a Researcher? Exploring the Development of Semantic Knowledge in Italian BabyLMs

Alice Suozzi, Luca Capone, Gianluca Lebani, Alessandro Lenci

Abstract

A large body of research has examined the linguistic abilities of language models (LMs) across various languages. However, conclusive evidence regarding their semantic competence and world knowledge remains limited, especially for low-resource languages. In this study, we explore the semantic competence of Italian BabyLMs, focusing on their sensitivity to semantic violations. To this end, we adapt a minimal pair benchmark targeting semantic violations to evaluate the semantic abilities of BAMBI, a family of small-scale models trained on progressively larger and more complex datasets. We further compare their performance, assessed through accuracy, mean log-likelihood offset, and expected calibration error, with that of three larger Italian LMs. Our findings shed light on this aspect of semantic competence in small-scale models and how this is affected by data scale and training strategies.

Anthology ID:: 2026.starsem-conference.24
Volume:: Proceedings of the 15th Joint Conference on Lexical and Computational Semantics (*SEM 2026)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Saif M. Mohammad, Nedjma Ousidhoum
Venues:: *SEM | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 366–377
Language:
URL:: https://aclanthology.org/2026.starsem-conference.24/
DOI:
Bibkey:
Cite (ACL):: Alice Suozzi, Luca Capone, Gianluca Lebani, and Alessandro Lenci. 2026. Can a Remedy Find a Researcher? Exploring the Development of Semantic Knowledge in Italian BabyLMs. In Proceedings of the 15th Joint Conference on Lexical and Computational Semantics (*SEM 2026), pages 366–377, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Can a Remedy Find a Researcher? Exploring the Development of Semantic Knowledge in Italian BabyLMs (Suozzi et al., *SEM 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.starsem-conference.24.pdf

PDF Cite Search Fix data