Qun Chen
2024
Transfer Learning for Text Classification via Model Risk Analysis
Yujie Sun
|
Chuyi Fan
|
Qun Chen
Findings of the Association for Computational Linguistics: EMNLP 2024
It has been well recognized that text classification can be satisfactorily performed by Deep Neural Network (DNN) models, provided that there are sufficient in-distribution training data. However, in the presence of distribution drift, a well trained DNN model may not perform well on a new dataset even though class labels are aligned between training and target datasets. To alleviate this limitation, we propose a novel approach based on model risk analysis to adapt a pre-trained DNN model towards a new dataset given only a small set of representative data. We first present a solution of model risk analysis for text classification, which can effectively quantify misprediction risk of a classifier on a dataset. Built upon the existing framework of LearnRisk, the proposed solution, denoted by LearnRisk-TC, first generates interpretable risk features, then constructs a risk model by aggregating these features, and finally trains the risk model on a small set of labeled data. Furthermore, we present a transfer learning solution based on model risk analysis, which can effectively fine-tune a pre-trained model toward a target dataset by minimizing its misprediction risk. We have conducted extensive experiments on real datasets. Our experimental results show that the proposed solution performs considerably better than the existing alternative approaches. By using text classification as a test case, we demonstrate the potential applicability of risk-based transfer learning to various challenging NLP tasks. Our codes are available at https://github.com/syjcomputer/LRTC.
2023
Supervised Gradual Machine Learning for Aspect-Term Sentiment Analysis
Yanyan Wang
|
Qun Chen
|
Murtadha H.M. Ahmed
|
Zhaoqiang Chen
|
Jing Su
|
Wei Pan
|
Zhanhuai Li
Transactions of the Association for Computational Linguistics, Volume 11
Recent work has shown that Aspect-Term Sentiment Analysis (ATSA) can be effectively performed by Gradual Machine Learning (GML). However, the performance of the current unsupervised solution is limited by inaccurate and insufficient knowledge conveyance. In this paper, we propose a supervised GML approach for ATSA, which can effectively exploit labeled training data to improve knowledge conveyance. It leverages binary polarity relations between instances, which can be either similar or opposite, to enable supervised knowledge conveyance. Besides the explicit polarity relations indicated by discourse structures, it also separately supervises a polarity classification DNN and a binary Siamese network to extract implicit polarity relations. The proposed approach fulfills knowledge conveyance by modeling detected relations as binary features in a factor graph. Our extensive experiments on real benchmark data show that it achieves the state-of-the-art performance across all the test workloads. Our work demonstrates clearly that, in collaboration with DNN for feature extraction, GML outperforms pure DNN solutions.
2021
DNN-driven Gradual Machine Learning for Aspect-term Sentiment Analysis
Murtadha Ahmed
|
Qun Chen
|
Yanyan Wang
|
Youcef Nafa
|
Zhanhuai Li
|
Tianyi Duan
Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021
Search
Co-authors
- Yanyan Wang 2
- Zhanhuai Li 2
- Yujie Sun 1
- Chuyi Fan 1
- Murtadha H.M. Ahmed 1
- show all...