Qingsong Lv
2022
Parameter-Efficient Tuning Makes a Good Classification Head
Zhuoyi Yang
|
Ming Ding
|
Yanhui Guo
|
Qingsong Lv
|
Jie Tang
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
In recent years, pretrained models revolutionized the paradigm of natural language understanding (NLU), where we append a randomly initialized classification head after the pretrained backbone, e.g. BERT, and finetune the whole model. As the pretrained backbone makes a major contribution to the improvement, we naturally expect a good pretrained classification head can also benefit the training. However, the final-layer output of the backbone, i.e. the input of the classification head, will change greatly during finetuning, making the usual head-only pretraining ineffective. In this paper, we find that parameter-efficient tuning makes a good classification head, with which we can simply replace the randomly initialized heads for a stable performance gain. Our experiments demonstrate that the classification head jointly pretrained with parameter-efficient tuning consistently improves the performance on 9 tasks in GLUE and SuperGLUE.
2018
Cross-lingual Knowledge Graph Alignment via Graph Convolutional Networks
Zhichun Wang
|
Qingsong Lv
|
Xiaohan Lan
|
Yu Zhang
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
Multilingual knowledge graphs (KGs) such as DBpedia and YAGO contain structured knowledge of entities in several distinct languages, and they are useful resources for cross-lingual AI and NLP applications. Cross-lingual KG alignment is the task of matching entities with their counterparts in different languages, which is an important way to enrich the cross-lingual links in multilingual KGs. In this paper, we propose a novel approach for cross-lingual KG alignment via graph convolutional networks (GCNs). Given a set of pre-aligned entities, our approach trains GCNs to embed entities of each language into a unified vector space. Entity alignments are discovered based on the distances between entities in the embedding space. Embeddings can be learned from both the structural and attribute information of entities, and the results of structure embedding and attribute embedding are combined to get accurate alignments. In the experiments on aligning real multilingual KGs, our approach gets the best performance compared with other embedding-based KG alignment approaches.
Search
Fix data
Co-authors
- Ming Ding 1
- Yanhui Guo 1
- Xiaohan Lan 1
- Jie Tang 1
- Zhichun Wang 1
- show all...