Ran Cheng
2023
NanoNER: Named Entity Recognition for Nanobiology Using Experts’ Knowledge and Distant Supervision
Ran Cheng
|
Martin Lentschat
|
Cyril Labbe
Proceedings of the Second Workshop on Information Extraction from Scientific Publications
2021
Revisiting Self-training for Few-shot Learning of Language Model
Yiming Chen
|
Yan Zhang
|
Chen Zhang
|
Grandee Lee
|
Ran Cheng
|
Haizhou Li
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
As unlabeled data carry rich task-relevant information, they are proven useful for few-shot learning of language model. The question is how to effectively make use of such data. In this work, we revisit the self-training technique for language model fine-tuning and present a state-of-the-art prompt-based few-shot learner, SFLM. Given two views of a text sample via weak and strong augmentation techniques, SFLM generates a pseudo label on the weakly augmented version. Then, the model predicts the same pseudo label when fine-tuned with the strongly augmented version. This simple approach is shown to outperform other state-of-the-art supervised and semi-supervised counterparts on six sentence classification and six sentence-pair classification benchmarking tasks. In addition, SFLM only relies on a few in-domain unlabeled data. We conduct a comprehensive analysis to demonstrate the robustness of our proposed approach under various settings, including augmentation techniques, model scale, and few-shot knowledge transfer across tasks.
Search
Fix data
Co-authors
- Yiming Chen 1
- Cyril Labbé 1
- Grandee Lee 1
- Martin Lentschat 1
- Haizhou Li 1
- show all...