ELERRANT: Automatic Grammatical Error Type Classification for Greek

Katerina Korre, Marita Chatzipanagiotou, John Pavlopoulos


Abstract
In this paper, we introduce the Greek version of the automatic annotation tool ERRANT (Bryant et al., 2017), which we named ELERRANT. ERRANT functions as a rule-based error type classifier and was used as the main evaluation tool of the systems participating in the BEA-2019 (Bryant et al., 2019) shared task. Here, we discuss grammatical and morphological differences between English and Greek and how these differences affected the development of ELERRANT. We also introduce the first Greek Native Corpus (GNC) and the Greek WikiEdits Corpus (GWE), two new evaluation datasets with errors from native Greek learners and Wikipedia Talk Pages edits respectively. These two datasets are used for the evaluation of ELERRANT. This paper is a sole fragment of a bigger picture which illustrates the attempt to solve the problem of low-resource languages in NLP, in our case Greek.
Anthology ID:
2021.ranlp-1.81
Volume:
Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021)
Month:
September
Year:
2021
Address:
Held Online
Editors:
Ruslan Mitkov, Galia Angelova
Venue:
RANLP
SIG:
Publisher:
INCOMA Ltd.
Note:
Pages:
708–717
Language:
URL:
https://aclanthology.org/2021.ranlp-1.81
DOI:
Bibkey:
Cite (ACL):
Katerina Korre, Marita Chatzipanagiotou, and John Pavlopoulos. 2021. ELERRANT: Automatic Grammatical Error Type Classification for Greek. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021), pages 708–717, Held Online. INCOMA Ltd..
Cite (Informal):
ELERRANT: Automatic Grammatical Error Type Classification for Greek (Korre et al., RANLP 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.ranlp-1.81.pdf
Data
WikiConv