System Combination for Grammatical Error Correction Based on Integer Programming

Ruixi Lin, Hwee Tou Ng


Abstract
In this paper, we propose a system combination method for grammatical error correction (GEC), based on nonlinear integer programming (IP). Our method optimizes a novel F score objective based on error types, and combines multiple end-to-end GEC systems. The proposed IP approach optimizes the selection of a single best system for each grammatical error type present in the data. Experiments of the IP approach on combining state-of-the-art standalone GEC systems show that the combined system outperforms all standalone systems. It improves F0.5 score by 3.61% when combining the two best participating systems in the BEA 2019 shared task, and achieves F0.5 score of 73.08%. We also perform experiments to compare our IP approach with another state-of-the-art system combination method for GEC, demonstrating IP’s competitive combination capability.
Anthology ID:
2021.ranlp-1.94
Volume:
Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021)
Month:
September
Year:
2021
Address:
Held Online
Editors:
Ruslan Mitkov, Galia Angelova
Venue:
RANLP
SIG:
Publisher:
INCOMA Ltd.
Note:
Pages:
824–829
Language:
URL:
https://aclanthology.org/2021.ranlp-1.94
DOI:
Bibkey:
Cite (ACL):
Ruixi Lin and Hwee Tou Ng. 2021. System Combination for Grammatical Error Correction Based on Integer Programming. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021), pages 824–829, Held Online. INCOMA Ltd..
Cite (Informal):
System Combination for Grammatical Error Correction Based on Integer Programming (Lin & Ng, RANLP 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.ranlp-1.94.pdf
Code
 nusnlp/gec_ip