FreeAL: Towards Human-Free Active Learning in the Era of Large Language Models

Ruixuan Xiao; Yiwen Dong; Junbo Zhao; Runze Wu; Minmin Lin; Gang Chen; Haobo Wang

doi:10.18653/v1/2023.emnlp-main.896

FreeAL: Towards Human-Free Active Learning in the Era of Large Language Models

Ruixuan Xiao, Yiwen Dong, Junbo Zhao, Runze Wu, Minmin Lin, Gang Chen, Haobo Wang

Abstract

Collecting high-quality labeled data for model training is notoriously time-consuming and labor-intensive for various NLP tasks. While copious solutions, such as active learning for small language models (SLMs) and prevalent in-context learning in the era of large language models (LLMs), have been proposed and alleviate the labeling burden to some extent, their performances are still subject to human intervention. It is still underexplored how to reduce the annotation cost in the LLMs era. To bridge this, we revolutionize traditional active learning and propose an innovative collaborative learning framework FreeAL to interactively distill and filter the task-specific knowledge from LLMs. During collaborative training, an LLM serves as an active annotator inculcating its coarse-grained knowledge, while a downstream SLM is incurred as a student to filter out high-quality in-context samples to feedback LLM for the subsequent label refinery. Extensive experiments on eight benchmark datasets demonstrate that FreeAL largely enhances the zero-shot performances for both SLM and LLM without any human supervision.

Anthology ID:: 2023.emnlp-main.896
Volume:: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
Month:: December
Year:: 2023
Address:: Singapore
Editors:: Houda Bouamor, Juan Pino, Kalika Bali
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 14520–14535
Language:
URL:: https://aclanthology.org/2023.emnlp-main.896/
DOI:: 10.18653/v1/2023.emnlp-main.896
Bibkey:
Cite (ACL):: Ruixuan Xiao, Yiwen Dong, Junbo Zhao, Runze Wu, Minmin Lin, Gang Chen, and Haobo Wang. 2023. FreeAL: Towards Human-Free Active Learning in the Era of Large Language Models. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 14520–14535, Singapore. Association for Computational Linguistics.
Cite (Informal):: FreeAL: Towards Human-Free Active Learning in the Era of Large Language Models (Xiao et al., EMNLP 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.emnlp-main.896.pdf
Video:: https://aclanthology.org/2023.emnlp-main.896.mp4

PDF Cite Search Video Fix data