Can Active Label Correction Improve LLM-based Modular AI Systems?

Karan Taneja, Ashok Goel


Abstract
Modular AI systems can be developed using LLM-prompts-based modules to minimize deployment time even for complex tasks. However, these systems do not always perform well and improving them using the data traces collected from a deployment remains an open challenge. The data traces contain LLM inputs and outputs, but the annotations from LLMs are noisy. We hypothesize that Active Label Correction (ALC) can be use on the collected data to train smaller task-specific improved models that can replace LLM-based modules. In this paper, we study the noise in three GPT-3.5-annotated datasets and their denoising with human feedback. We also propose a novel method ALC3 that iteratively applies three updates to the training dataset: auto-correction, correction using human feedback and filtering. Our results show that ALC3 can lead to oracle performance with feedback on 17-24% fewer examples than the number of noisy examples in the dataset across three different NLP tasks.
Anthology ID:
2024.emnlp-main.509
Volume:
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Month:
November
Year:
2024
Address:
Miami, Florida, USA
Editors:
Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
9019–9031
Language:
URL:
https://aclanthology.org/2024.emnlp-main.509
DOI:
Bibkey:
Cite (ACL):
Karan Taneja and Ashok Goel. 2024. Can Active Label Correction Improve LLM-based Modular AI Systems?. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pages 9019–9031, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):
Can Active Label Correction Improve LLM-based Modular AI Systems? (Taneja & Goel, EMNLP 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.emnlp-main.509.pdf