Automatic Generation of Corpus-Based Exercises Using Generative AI

Adrian Jan Zasina


Abstract
This study explores the automatic generation of corpus-based language exercises using generative AI models. We focus on the interaction between language models and corpus data, detailing a workflow in which lexical and syntactic patterns are extracted from a tagged corpus and structured prompts are constructed to guide the model in producing sentence-level exercises. The generated exercises reveal both the potential of AI-driven approaches. However, observations highlight the necessity of careful design and critical evaluation when integrating generative models with corpus-based language materials. By analysing these processes from a computational linguistics perspective, this study contributes to understanding how generative AI can interact with structured linguistic data, informing future applications in automated language resources.
Anthology ID:
2025.rocling-main.9
Volume:
Proceedings of the 37th Conference on Computational Linguistics and Speech Processing (ROCLING 2025)
Month:
November
Year:
2025
Address:
National Taiwan University, Taipei City, Taiwan
Editors:
Kai-Wei Chang, Ke-Han Lu, Chih-Kai Yang, Zhi-Rui Tam, Wen-Yu Chang, Chung-Che Wang
Venue:
ROCLING
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
80–86
Language:
URL:
https://aclanthology.org/2025.rocling-main.9/
DOI:
Bibkey:
Cite (ACL):
Adrian Jan Zasina. 2025. Automatic Generation of Corpus-Based Exercises Using Generative AI. In Proceedings of the 37th Conference on Computational Linguistics and Speech Processing (ROCLING 2025), pages 80–86, National Taiwan University, Taipei City, Taiwan. Association for Computational Linguistics.
Cite (Informal):
Automatic Generation of Corpus-Based Exercises Using Generative AI (Zasina, ROCLING 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.rocling-main.9.pdf