Chinese Grammatical Error Diagnosis using Statistical and Prior Knowledge driven Features with Probabilistic Ensemble Enhancement

Ruiji Fu; Zhengqi Pei; Jiefu Gong; Wei Song; Dechuan Teng; Wanxiang Che; Shijin Wang; Guoping Hu; Ting Liu

doi:10.18653/v1/W18-3707

Chinese Grammatical Error Diagnosis using Statistical and Prior Knowledge driven Features with Probabilistic Ensemble Enhancement

Ruiji Fu, Zhengqi Pei, Jiefu Gong, Wei Song, Dechuan Teng, Wanxiang Che, Shijin Wang, Guoping Hu, Ting Liu

Abstract

This paper describes our system at NLPTEA-2018 Task #1: Chinese Grammatical Error Diagnosis. Grammatical Error Diagnosis is one of the most challenging NLP tasks, which is to locate grammar errors and tell error types. Our system is built on the model of bidirectional Long Short-Term Memory with a conditional random field layer (BiLSTM-CRF) but integrates with several new features. First, richer features are considered in the BiLSTM-CRF model; second, a probabilistic ensemble approach is adopted; third, Template Matcher are used during a post-processing to bring in human knowledge. In official evaluation, our system obtains the highest F1 scores at identifying error types and locating error positions, the second highest F1 score at sentence level error detection. We also recommend error corrections for specific error types and achieve the best F1 performance among all participants.

Anthology ID:: W18-3707
Volume:: Proceedings of the 5th Workshop on Natural Language Processing Techniques for Educational Applications
Month:: July
Year:: 2018
Address:: Melbourne, Australia
Editors:: Yuen-Hsien Tseng, Hsin-Hsi Chen, Vincent Ng, Mamoru Komachi
Venue:: NLP-TEA
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 52–59
Language:
URL:: https://aclanthology.org/W18-3707/
DOI:: 10.18653/v1/W18-3707
Bibkey:
Cite (ACL):: Ruiji Fu, Zhengqi Pei, Jiefu Gong, Wei Song, Dechuan Teng, Wanxiang Che, Shijin Wang, Guoping Hu, and Ting Liu. 2018. Chinese Grammatical Error Diagnosis using Statistical and Prior Knowledge driven Features with Probabilistic Ensemble Enhancement. In Proceedings of the 5th Workshop on Natural Language Processing Techniques for Educational Applications, pages 52–59, Melbourne, Australia. Association for Computational Linguistics.
Cite (Informal):: Chinese Grammatical Error Diagnosis using Statistical and Prior Knowledge driven Features with Probabilistic Ensemble Enhancement (Fu et al., NLP-TEA 2018)
Copy Citation:
PDF:: https://aclanthology.org/W18-3707.pdf

PDF Cite Search Fix data