Lightweight Contenders: Navigating Semi-Supervised Text Mining through Peer Collaboration and Self Transcendence

Qianren Mao; Weifeng Jiang; Junnan Liu; Chenghua Lin; Qian Li; Xianqing Wen; Jianxin Li; Jinhu Lu

doi:10.18653/v1/2025.findings-naacl.309

Lightweight Contenders: Navigating Semi-Supervised Text Mining through Peer Collaboration and Self Transcendence

Qianren Mao, Weifeng Jiang, Junnan Liu, Chenghua Lin, Qian Li, Xianqing Wen, Jianxin Li, Jinhu Lu

Abstract

The semi-supervised learning (SSL) strategy in lightweight models requires reducing annotated samples and facilitating cost-effective inference. However, the constraint on model parameters, imposed by the scarcity of training labels, limits the SSL performance. In this paper, we introduce PS-NET, a novel framework tailored for semi-supervised text mining with lightweight models. PS-NET incorporates online distillation to train lightweight student models by imitating the Teacher model. It also integrates an ensemble of student peers that collaboratively instruct each other. Additionally, PS-NET implements a constant adversarial perturbation schema to further self-augmentation by progressive generalizing. Our PS-NET, equipped with a 2-layer distilled BERT, exhibits notable performance enhancements over SOTA lightweight SSL frameworks of FLiText and Disco in SSL text classification with extremely rare labelled data.

Anthology ID:: 2025.findings-naacl.309
Volume:: Findings of the Association for Computational Linguistics: NAACL 2025
Month:: April
Year:: 2025
Address:: Albuquerque, New Mexico
Editors:: Luis Chiruzzo, Alan Ritter, Lu Wang
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 5586–5600
Language:
URL:: https://aclanthology.org/2025.findings-naacl.309/
DOI:: 10.18653/v1/2025.findings-naacl.309
Bibkey:
Cite (ACL):: Qianren Mao, Weifeng Jiang, Junnan Liu, Chenghua Lin, Qian Li, Xianqing Wen, Jianxin Li, and Jinhu Lu. 2025. Lightweight Contenders: Navigating Semi-Supervised Text Mining through Peer Collaboration and Self Transcendence. In Findings of the Association for Computational Linguistics: NAACL 2025, pages 5586–5600, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):: Lightweight Contenders: Navigating Semi-Supervised Text Mining through Peer Collaboration and Self Transcendence (Mao et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-naacl.309.pdf

PDF Cite Search Fix data