Shota Koyama
2024
n-gram F-score for Evaluating Grammatical Error Correction
Shota Koyama
|
Ryo Nagata
|
Hiroya Takamura
|
Naoaki Okazaki
Proceedings of the 17th International Natural Language Generation Conference
M2 and its variants are the most widely used automatic evaluation metrics for grammatical error correction (GEC), which calculate an F-score using a phrase-based alignment between sentences. However, it is not straightforward at all to align learner sentences containing errors to their correct sentences. In addition, alignment calculations are computationally expensive. We propose GREEN, an alignment-free F-score for GEC evaluation. GREEN treats a sentence as a multiset of n-grams and extracts edits between sentences by set operations instead of computing an alignment. Our experiments confirm that GREEN performs better than existing methods for the corpus-level metrics and comparably for the sentence-level metrics even without computing an alignment. GREEN is available at https://github.com/shotakoyama/green.
2023
The Tokyo Tech and AIST System at the GenChal 2022 Shared Task on Feedback Comment Generation
Shota Koyama
|
Hiroya Takamura
|
Naoaki Okazaki
Proceedings of the 16th International Natural Language Generation Conference: Generation Challenges
This paper describes the Tokyo Tech and AIST system in the GenChal 2022 shared task, which is the first shared task of feedback comment generation. We adopted five methods: data cleaning, fine-tuning pre-trained models, correcting errors in learners’ sentences, appending a correcting operation, and filtering out irrelevant outputs. Our system achieved F1 = 43.4 on the test dataset.
2021
Various Errors Improve Neural Grammatical Error Correction
Shota Koyama
|
Hiroya Takamura
|
Naoaki Okazaki
Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation