NTUCLE: Developing a Corpus of Learner English to Provide Writing Support for Engineering Students

Roger Vivek Placidus Winder, Joseph MacKinnon, Shu Yun Li, Benedict Christopher Tzer Liang Lin, Carmel Lee Hah Heah, Luís Morgado da Costa, Takayuki Kuribayashi, Francis Bond


Abstract
This paper describes the creation of a new annotated learner corpus. The aim is to use this corpus to develop an automated system for corrective feedback on students’ writing. With this system, students will be able to receive timely feedback on language errors before they submit their assignments for grading. A corpus of assignments submitted by first year engineering students was compiled, and a new error tag set for the NTU Corpus of Learner English (NTUCLE) was developed based on that of the NUS Corpus of Learner English (NUCLE), as well as marking rubrics used at NTU. After a description of the corpus, error tag set and annotation process, the paper presents the results of the annotation exercise as well as follow up actions. The final error tag set, which is significantly larger than that for the NUCLE error categories, is then presented before a brief conclusion summarising our experience and future plans.
Anthology ID:
W17-5901
Volume:
Proceedings of the 4th Workshop on Natural Language Processing Techniques for Educational Applications (NLPTEA 2017)
Month:
December
Year:
2017
Address:
Taipei, Taiwan
Editors:
Yuen-Hsien Tseng, Hsin-Hsi Chen, Lung-Hao Lee, Liang-Chih Yu
Venue:
NLP-TEA
SIG:
Publisher:
Asian Federation of Natural Language Processing
Note:
Pages:
1–11
Language:
URL:
https://aclanthology.org/W17-5901/
DOI:
Bibkey:
Cite (ACL):
Roger Vivek Placidus Winder, Joseph MacKinnon, Shu Yun Li, Benedict Christopher Tzer Liang Lin, Carmel Lee Hah Heah, Luís Morgado da Costa, Takayuki Kuribayashi, and Francis Bond. 2017. NTUCLE: Developing a Corpus of Learner English to Provide Writing Support for Engineering Students. In Proceedings of the 4th Workshop on Natural Language Processing Techniques for Educational Applications (NLPTEA 2017), pages 1–11, Taipei, Taiwan. Asian Federation of Natural Language Processing.
Cite (Informal):
NTUCLE: Developing a Corpus of Learner English to Provide Writing Support for Engineering Students (Winder et al., NLP-TEA 2017)
Copy Citation:
PDF:
https://aclanthology.org/W17-5901.pdf