ARAIDA: Analogical Reasoning-Augmented Interactive Data Annotation

Chen Huang; Yiping Jin; Ilija Ilievski; Wenqiang Lei; Jiancheng Lv

ARAIDA: Analogical Reasoning-Augmented Interactive Data Annotation

Chen Huang, Yiping Jin, Ilija Ilievski, Wenqiang Lei, Jiancheng Lv

Abstract

Human annotation is a time-consuming task that requires a significant amount of effort. To address this issue, interactive data annotation utilizes an annotation model to provide suggestions for humans to approve or correct. However, annotation models trained with limited labeled data are prone to generating incorrect suggestions, leading to extra human correction effort. To tackle this challenge, we propose Araida, an analogical reasoning-based approach that enhances automatic annotation accuracy in the interactive data annotation setting and reduces the need for human corrections. Araida involves an error-aware integration strategy that dynamically coordinates an annotation model and a k-nearest neighbors (KNN) model, giving more importance to KNN’s predictions when predictions from the annotation model are deemed inaccurate. Empirical studies demonstrate that Araida is adaptable to different annotation tasks and models. On average, it reduces human correction labor by 11.02% compared to vanilla interactive data annotation methods.

Anthology ID:: 2024.acl-long.574
Volume:: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: August
Year:: 2024
Address:: Bangkok, Thailand
Editors:: Lun-Wei Ku, Andre Martins, Vivek Srikumar
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 10660–10675
Language:
URL:: https://aclanthology.org/2024.acl-long.574
DOI:
Bibkey:
Cite (ACL):: Chen Huang, Yiping Jin, Ilija Ilievski, Wenqiang Lei, and Jiancheng Lv. 2024. ARAIDA: Analogical Reasoning-Augmented Interactive Data Annotation. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 10660–10675, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):: ARAIDA: Analogical Reasoning-Augmented Interactive Data Annotation (Huang et al., ACL 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.acl-long.574.pdf

PDF Cite Search