Personalizing Grammatical Error Correction: Adaptation to Proficiency Level and L1

Maria Nadejde, Joel Tetreault


Abstract
Grammar error correction (GEC) systems have become ubiquitous in a variety of software applications, and have started to approach human-level performance for some datasets. However, very little is known about how to efficiently personalize these systems to the user’s characteristics, such as their proficiency level and first language, or to emerging domains of text. We present the first results on adapting a general purpose neural GEC system to both the proficiency level and the first language of a writer, using only a few thousand annotated sentences. Our study is the broadest of its kind, covering five proficiency levels and twelve different languages, and comparing three different adaptation scenarios: adapting to the proficiency level only, to the first language only, or to both aspects simultaneously. We show that tailoring to both scenarios achieves the largest performance improvement (3.6 F0.5) relative to a strong baseline.
Anthology ID:
D19-5504
Volume:
Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT 2019)
Month:
November
Year:
2019
Address:
Hong Kong, China
Editors:
Wei Xu, Alan Ritter, Tim Baldwin, Afshin Rahimi
Venue:
WNUT
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
27–33
Language:
URL:
https://aclanthology.org/D19-5504
DOI:
10.18653/v1/D19-5504
Bibkey:
Cite (ACL):
Maria Nadejde and Joel Tetreault. 2019. Personalizing Grammatical Error Correction: Adaptation to Proficiency Level and L1. In Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT 2019), pages 27–33, Hong Kong, China. Association for Computational Linguistics.
Cite (Informal):
Personalizing Grammatical Error Correction: Adaptation to Proficiency Level and L1 (Nadejde & Tetreault, WNUT 2019)
Copy Citation:
PDF:
https://aclanthology.org/D19-5504.pdf
Attachment:
 D19-5504.Attachment.pdf
Data
FCE