Detection-Correction Structure via General Language Model for Grammatical Error Correction

Wei Li (李维, 李炜); Houfeng Wang

doi:10.18653/v1/2024.acl-long.96

Detection-Correction Structure via General Language Model for Grammatical Error Correction

Abstract

Grammatical error correction (GEC) is a task dedicated to rectifying texts with minimal edits, which can be decoupled into two components: detection and correction. However, previous works have predominantly focused on direct correction, with no prior efforts to integrate both into a single model. Moreover, the exploration of the detection-correction paradigm by large language models (LLMs) remains underdeveloped. This paper introduces an integrated detection-correction structure, named DeCoGLM, based on the General Language Model (GLM). The detection phase employs a fault-tolerant detection template, while the correction phase leverages autoregressive mask infilling for localized error correction. Through the strategic organization of input tokens and modification of attention masks, we facilitate multi-task learning within a single model. Our model demonstrates competitive performance against the state-of-the-art models on English and Chinese GEC datasets. Further experiments present the effectiveness of the detection-correction structure in LLMs, suggesting a promising direction for GEC.

Anthology ID:: 2024.luhme-long.96
Volume:: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: August
Year:: 2024
Address:: Bangkok, Thailand
Editors:: Lun-Wei Ku, Andre Martins, Vivek Srikumar
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1748–1763
Language:
URL:: https://aclanthology.org/2024.luhme-long.96/
DOI:: 10.18653/v1/2024.acl-long.96
Bibkey:
Cite (ACL):: Wei Li and Houfeng Wang. 2024. Detection-Correction Structure via General Language Model for Grammatical Error Correction. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1748–1763, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):: Detection-Correction Structure via General Language Model for Grammatical Error Correction (Li & Wang, ACL 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.acl-long.96.pdf

PDF Cite Search Fix data