Chinese Grammatical Error Diagnosis Based on Policy Gradient LSTM Model

Changliang Li, Ji Qi


Abstract
Chinese Grammatical Error Diagnosis (CGED) is a natural language processing task for the NLPTEA2018 workshop held during ACL2018. The goal of this task is to diagnose Chinese sentences containing four kinds of grammatical errors through the model and find out the sentence errors. Chinese grammatical error diagnosis system is a very important tool, which can help Chinese learners automatically diagnose grammatical errors in many scenarios. However, due to the limitations of the Chinese language’s own characteristics and datasets, the traditional model faces the problem of extreme imbalances in the positive and negative samples and the disappearance of gradients. In this paper, we propose a sequence labeling method based on the Policy Gradient LSTM model and apply it to this task to solve the above problems. The results show that our model can achieve higher precision scores in the case of lower False positive rate (FPR) and it is convenient to optimize the model on-line.
Anthology ID:
W18-3710
Volume:
Proceedings of the 5th Workshop on Natural Language Processing Techniques for Educational Applications
Month:
July
Year:
2018
Address:
Melbourne, Australia
Editors:
Yuen-Hsien Tseng, Hsin-Hsi Chen, Vincent Ng, Mamoru Komachi
Venue:
NLP-TEA
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
77–82
Language:
URL:
https://aclanthology.org/W18-3710
DOI:
10.18653/v1/W18-3710
Bibkey:
Cite (ACL):
Changliang Li and Ji Qi. 2018. Chinese Grammatical Error Diagnosis Based on Policy Gradient LSTM Model. In Proceedings of the 5th Workshop on Natural Language Processing Techniques for Educational Applications, pages 77–82, Melbourne, Australia. Association for Computational Linguistics.
Cite (Informal):
Chinese Grammatical Error Diagnosis Based on Policy Gradient LSTM Model (Li & Qi, NLP-TEA 2018)
Copy Citation:
PDF:
https://aclanthology.org/W18-3710.pdf