Paragraph-level Error Correction and Explanation Generation: Case Study for Estonian

Martin Vainikko; Taavi Kamarik; Karina Kert; Krista Liin; Silvia Maine; Kais Allkivi; Annekatrin Kaivapalu; Mark Fishel

doi:10.18653/v1/2025.bea-1.72

Paragraph-level Error Correction and Explanation Generation: Case Study for Estonian

Martin Vainikko, Taavi Kamarik, Karina Kert, Krista Liin, Silvia Maine, Kais Allkivi, Annekatrin Kaivapalu, Mark Fishel

Abstract

We present a case study on building task-specific models for grammatical error correction and explanation generation tailored to learners of Estonian. Our approach handles whole paragraphs instead of sentences and leverages prompting proprietary large language models for generating synthetic training data, addressing the limited availability of error correction data and the complete absence of correction justification/explanation data in Estonian. We describe the chosen approach and pipeline and provide technical details for the experimental part. The final outcome is a set of open-weight models, which are released with a permissive license along with the generated synthetic error correction and explanation data.

Anthology ID:: 2025.bea-1.72
Volume:: Proceedings of the 20th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2025)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Ekaterina Kochmar, Bashar Alhafni, Marie Bexte, Jill Burstein, Andrea Horbach, Ronja Laarmann-Quante, Anaïs Tack, Victoria Yaneva, Zheng Yuan
Venues:: BEA | WS
SIG:: SIGEDU
Publisher:: Association for Computational Linguistics
Note:
Pages:: 953–967
Language:
URL:: https://aclanthology.org/2025.bea-1.72/
DOI:: 10.18653/v1/2025.bea-1.72
Bibkey:
Cite (ACL):: Martin Vainikko, Taavi Kamarik, Karina Kert, Krista Liin, Silvia Maine, Kais Allkivi, Annekatrin Kaivapalu, and Mark Fishel. 2025. Paragraph-level Error Correction and Explanation Generation: Case Study for Estonian. In Proceedings of the 20th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2025), pages 953–967, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Paragraph-level Error Correction and Explanation Generation: Case Study for Estonian (Vainikko et al., BEA 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.bea-1.72.pdf

PDF Cite Search Fix data