Large Language Models and Children Have Different Learning Trajectories in Determiner Acquisition

Olivia La Fiandra; Nathalie Fernandez Echeverri; Patrick Shafto; Naomi Feldman

doi:10.18653/v1/2025.babylm-main.8

Large Language Models and Children Have Different Learning Trajectories in Determiner Acquisition

Olivia La Fiandra, Nathalie Fernandez Echeverri, Patrick Shafto, Naomi H. Feldman

Abstract

Large language models are often compared to human learners based on the amount of training data required or the end state capabilities of a learner, yet less attention has been given to differences in their language learning process. This study uses determiner acquisition as a case study to characterize how LLMs and children differ in their learning processes. By analyzing annotated speech samples from specified age ranges of four children and intermediate training checkpoints of the Pythia-70m language model, we trace the learners’ learning paths of definite and indefinite determiner use. Our results reveal a divergence: the children first produce the indefinite determiner, while the model first produces the definite determiner. This difference reflects underlying differences in the learning goals and mechanisms of models and children. Framing language learning as movement over distributions of linguistic features makes the learning process visible and offers an alternative approach for comparing humans and language models.

Anthology ID:: 2025.babylm-main.8
Volume:: Proceedings of the First BabyLM Workshop
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Lucas Charpentier, Leshem Choshen, Ryan Cotterell, Mustafa Omer Gul, Michael Y. Hu, Jing Liu, Jaap Jumelet, Tal Linzen, Aaron Mueller, Candace Ross, Raj Sanjay Shah, Alex Warstadt, Ethan Gotlieb Wilcox, Adina Williams
Venue:: BabyLM
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 100–108
Language:
URL:: https://aclanthology.org/2025.babylm-main.8/
DOI:: 10.18653/v1/2025.babylm-main.8
Bibkey:
Cite (ACL):: Olivia La Fiandra, Nathalie Fernandez Echeverri, Patrick Shafto, and Naomi H. Feldman. 2025. Large Language Models and Children Have Different Learning Trajectories in Determiner Acquisition. In Proceedings of the First BabyLM Workshop, pages 100–108, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Large Language Models and Children Have Different Learning Trajectories in Determiner Acquisition (Fiandra et al., BabyLM 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.babylm-main.8.pdf

PDF Cite Search Fix data