Grammatical Error Detection Based on Machine Learning for Mandarin as Second Language Learning

Jui-Feng Yeh, Tsung-Wei Hsu, Chan-Kun Yeh


Abstract
Mandarin is not simple language for foreigner. Even using Mandarin as the mother tongue, they have to spend more time to learn when they were child. The following issues are the reason why causes learning problem. First, the word is envolved by Hieroglyphic. So a character can express meanings independently, but become a word has another semantic. Second, the Mandarin’s grammars have flexible rule and special usage. Therefore, the common grammatical errors can classify to missing, redundant, selection and disorder. In this paper, we proposed the structure of the Recurrent Neural Networks using Long Short-term memory (RNN-LSTM). It can detect the error type from the foreign learner writing. The features based on the word vector and part-of-speech vector. In the test data found that our method in the detection level of recall better than the others, even as high as 0.9755. That is because we give the possibility of greater choice in detecting errors.
Anthology ID:
W16-4918
Volume:
Proceedings of the 3rd Workshop on Natural Language Processing Techniques for Educational Applications (NLPTEA2016)
Month:
December
Year:
2016
Address:
Osaka, Japan
Venues:
NLP-TEA | WS
SIG:
Publisher:
The COLING 2016 Organizing Committee
Note:
Pages:
140–147
Language:
URL:
https://aclanthology.org/W16-4918
DOI:
Bibkey:
Cite (ACL):
Jui-Feng Yeh, Tsung-Wei Hsu, and Chan-Kun Yeh. 2016. Grammatical Error Detection Based on Machine Learning for Mandarin as Second Language Learning. In Proceedings of the 3rd Workshop on Natural Language Processing Techniques for Educational Applications (NLPTEA2016), pages 140–147, Osaka, Japan. The COLING 2016 Organizing Committee.
Cite (Informal):
Grammatical Error Detection Based on Machine Learning for Mandarin as Second Language Learning (Yeh et al., 2016)
Copy Citation:
PDF:
https://aclanthology.org/W16-4918.pdf