ActiveAED: A Human in the Loop Improves Annotation Error Detection

Leon Weber, Barbara Plank


Abstract
Manually annotated datasets are crucial for training and evaluating Natural Language Processing models. However, recent work has discovered that even widely-used benchmark datasets contain a substantial number of erroneous annotations. This problem has been addressed with Annotation Error Detection (AED) models, which can flag such errors for human re-annotation. However, even though many of these AED methods assume a final curation step in which a human annotator decides whether the annotation is erroneous, they have been developed as static models without any human-in-the-loop component. In this work, we propose ActiveAED, an AED method that can detect errors more accurately by repeatedly querying a human for error corrections in its prediction loop. We evaluate ActiveAED on eight datasets spanning five different tasks and find that it leads to improvements over the state of the art on seven of them, with gains of up to six percentage points in average precision.
Anthology ID:
2023.findings-acl.562
Volume:
Findings of the Association for Computational Linguistics: ACL 2023
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
8834–8845
Language:
URL:
https://aclanthology.org/2023.findings-acl.562
DOI:
10.18653/v1/2023.findings-acl.562
Bibkey:
Cite (ACL):
Leon Weber and Barbara Plank. 2023. ActiveAED: A Human in the Loop Improves Annotation Error Detection. In Findings of the Association for Computational Linguistics: ACL 2023, pages 8834–8845, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
ActiveAED: A Human in the Loop Improves Annotation Error Detection (Weber & Plank, Findings 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.findings-acl.562.pdf