Once Upon a Time: Interactive Learning for Storytelling with Small Language Models

Jonas Mayer Martins; Ali Hamza Bashir; Muhammad Rehan Khalid; Lisa Beinborn

doi:10.18653/v1/2025.babylm-main.32

Once Upon a Time: Interactive Learning for Storytelling with Small Language Models

Jonas Mayer Martins, Ali Hamza Bashir, Muhammad Rehan Khalid, Lisa Beinborn

Abstract

Children efficiently acquire language not just by listening, but by interacting with others in their social environment. Conversely, large language models are typically trained with next-word prediction on massive amounts of text. Motivated by this contrast, we investigate whether language models can be trained with less data by learning not only from next-word prediction but also from high-level, cognitively inspired feedback. We train a student model to generate stories, which a teacher model rates on readability, narrative coherence, and creativity. By varying the amount of pretraining before the feedback loop, we assess the impact of this interactive learning on formal and functional linguistic competence. We find that the high-level feedback is highly data efficient: With just 1 M words of input in interactive learning, storytelling skills can improve as much as with 410 M words of next-word prediction.

Anthology ID:: 2025.babylm-main.32
Original:: 2025.babylm-main.32v1
Version 2:: 2025.babylm-main.32v2
Volume:: Proceedings of the First BabyLM Workshop
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Lucas Charpentier, Leshem Choshen, Ryan Cotterell, Mustafa Omer Gul, Michael Y. Hu, Jing Liu, Jaap Jumelet, Tal Linzen, Aaron Mueller, Candace Ross, Raj Sanjay Shah, Alex Warstadt, Ethan Gotlieb Wilcox, Adina Williams
Venue:: BabyLM
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 454–468
Language:
URL:: https://aclanthology.org/2025.babylm-main.32/
DOI:: 10.18653/v1/2025.babylm-main.32
Bibkey:
Cite (ACL):: Jonas Mayer Martins, Ali Hamza Bashir, Muhammad Rehan Khalid, and Lisa Beinborn. 2025. Once Upon a Time: Interactive Learning for Storytelling with Small Language Models. In Proceedings of the First BabyLM Workshop, pages 454–468, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Once Upon a Time: Interactive Learning for Storytelling with Small Language Models (Mayer Martins et al., BabyLM 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.babylm-main.32.pdf

PDF (v2) PDF (v1) Cite Search Fix data