Analyzing the Impact of Spelling Errors on POS-Tagging and Chunking in Learner English

Tomoya Mizumoto, Ryo Nagata


Abstract
Part-of-speech (POS) tagging and chunking have been used in tasks targeting learner English; however, to the best our knowledge, few studies have evaluated their performance and no studies have revealed the causes of POS-tagging/chunking errors in detail. Therefore, we investigate performance and analyze the causes of failure. We focus on spelling errors that occur frequently in learner English. We demonstrate that spelling errors reduced POS-tagging performance by 0.23% owing to spelling errors, and that a spell checker is not necessary for POS-tagging/chunking of learner English.
Anthology ID:
W17-5909
Volume:
Proceedings of the 4th Workshop on Natural Language Processing Techniques for Educational Applications (NLPTEA 2017)
Month:
December
Year:
2017
Address:
Taipei, Taiwan
Editors:
Yuen-Hsien Tseng, Hsin-Hsi Chen, Lung-Hao Lee, Liang-Chih Yu
Venue:
NLP-TEA
SIG:
Publisher:
Asian Federation of Natural Language Processing
Note:
Pages:
54–58
Language:
URL:
https://aclanthology.org/W17-5909
DOI:
Bibkey:
Cite (ACL):
Tomoya Mizumoto and Ryo Nagata. 2017. Analyzing the Impact of Spelling Errors on POS-Tagging and Chunking in Learner English. In Proceedings of the 4th Workshop on Natural Language Processing Techniques for Educational Applications (NLPTEA 2017), pages 54–58, Taipei, Taiwan. Asian Federation of Natural Language Processing.
Cite (Informal):
Analyzing the Impact of Spelling Errors on POS-Tagging and Chunking in Learner English (Mizumoto & Nagata, NLP-TEA 2017)
Copy Citation:
PDF:
https://aclanthology.org/W17-5909.pdf