Guidong Zheng
2022
All Information is Valuable: Question Matching over Full Information Transmission Network
Le Qi
|
Yu Zhang
|
Qingyu Yin
|
Guidong Zheng
|
Wen Junjie
|
Jinlong Li
|
Ting Liu
Findings of the Association for Computational Linguistics: NAACL 2022
Question matching is the task of identifying whether two questions have the same intent. For better reasoning the relationship between questions, existing studies adopt multiple interaction modules and perform multi-round reasoning via deep neural networks. In this process, there are two kinds of critical information that are commonly employed: the representation information of original questions and the interactive information between pairs of questions. However, previous studies tend to transmit only one kind of information, while failing to utilize both kinds of information simultaneously. To address this problem, in this paper, we propose a Full Information Transmission Network (FITN) that can transmit both representation and interactive information together in a simultaneous fashion. More specifically, we employ a novel memory-based attention for keeping and transmitting the interactive information through a global interaction matrix. Besides, we apply an original-average mixed connection method to effectively transmit the representation information between different reasoning rounds, which helps to preserve the original representation features of questions along with the historical hidden features. Experiments on two standard benchmarks demonstrate that our approach outperforms strong baseline models.
Learning to Detect Noisy Labels Using Model-Based Features
Zhihao Wang
|
Zongyu Lin
|
Junjie Wen
|
Xianxin Chen
|
Peiqi Liu
|
Guidong Zheng
|
Yujun Chen
|
Zhilin Yang
Findings of the Association for Computational Linguistics: EMNLP 2022
Label noise is ubiquitous in various machine learning scenarios such as self-labeling with model predictions and erroneous data annotation. Many existing approaches are based on heuristics such as sample losses, which might not be flexible enough to achieve optimal solutions. Meta learning based methods address this issue by learning a data selection function, but can be hard to optimize. In light of these pros and cons, we propose SENT (Selection-Enhanced Noisy label Training) that does not rely on meta learning while having the flexibility of being data-driven. SENT transfers the noise distribution to a clean set and trains a model to distinguish noisy labels from clean ones using model-based features. Empirically, on a wide range of tasks including text classification and speech recognition, SENT improves performance over strong baselines under the settings of self-training and label corruption.
Search
Fix data
Co-authors
- Xianxin Chen 1
- Yujun Chen 1
- Wen Junjie 1
- Jinlong Li 1
- Zongyu Lin 1
- show all...