Encoder-Decoder Models Can Benefit from Pre-trained Masked Language Models in Grammatical Error Correction Masahiro Kaneko author Masato Mita author Shun Kiyono author Jun Suzuki author Kentaro Inui author 2020-07 text Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics Dan Jurafsky editor Joyce Chai editor Natalie Schluter editor Joel Tetreault editor Association for Computational Linguistics Online conference publication kaneko-etal-2020-encoder 10.18653/v1/2020.acl-main.391 https://aclanthology.org/2020.acl-main.391/ 2020-07 4248 4254