Self-Evolution Learning for Discriminative Language Model Pretraining

Qihuang Zhong, Liang Ding, Juhua Liu, Bo Du, Dacheng Tao


Abstract
Masked language modeling, widely used in discriminative language model (e.g., BERT) pretraining, commonly adopts a random masking strategy. However, random masking does not consider the importance of the different words in the sentence meaning, where some of them are more worthy to be predicted. Therefore, various masking strategies (e.g., entity-level masking) are proposed, but most of them require expensive prior knowledge and generally train from scratch without reusing existing model weights. In this paper, we present Self-Evolution learning (SE), a simple and effective token masking and learning method to fully and wisely exploit the knowledge from data. SE focuses on learning the informative yet under-explored tokens and adaptively regularizes the training by introducing a novel Token-specific Label Smoothing approach. Experiments on 10 tasks show that our SE brings consistent and significant improvements (+1.43 2.12 average scores) upon different PLMs. In-depth analyses demonstrate that SE improves linguistic knowledge learning and generalization.
Anthology ID:
2023.findings-acl.254
Volume:
Findings of the Association for Computational Linguistics: ACL 2023
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
4130–4145
Language:
URL:
https://aclanthology.org/2023.findings-acl.254
DOI:
10.18653/v1/2023.findings-acl.254
Bibkey:
Cite (ACL):
Qihuang Zhong, Liang Ding, Juhua Liu, Bo Du, and Dacheng Tao. 2023. Self-Evolution Learning for Discriminative Language Model Pretraining. In Findings of the Association for Computational Linguistics: ACL 2023, pages 4130–4145, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
Self-Evolution Learning for Discriminative Language Model Pretraining (Zhong et al., Findings 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.findings-acl.254.pdf
Video:
 https://aclanthology.org/2023.findings-acl.254.mp4