Yu-Chen Lin
2024
ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning
Yu-Chen Lin
|
Wei-Hua Li
|
Jun-cheng Chen
|
Chu-Song Chen
Findings of the Association for Computational Linguistics: EMNLP 2024
Prompt Tuning has been a popular Parameter-Efficient Fine-Tuning method attributed to its remarkable performance with few updated parameters on various large-scale pretrained Language Models (PLMs). Traditionally, each prompt has been considered indivisible and updated independently, leading the parameters increase proportionally as prompt length grows. To address this issue, we propose Adaptive Codebook for Composite and Efficient Prompt Tuning (ACCEPT). In our method, we refer to the concept of product quantization (PQ), allowing all soft prompts to share a set of learnable codebook vectors in each subspace, with each prompt differentiated by a set of adaptive weights. We achieve the superior performance on 17 diverse natural language tasks including natural language understanding (NLU) and question answering (QA) tasks by tuning only 0.3% of parameters of the PLMs. Our approach also excels in few-shot and large model settings, highlighting its significant potential.
2023
Linear Classifier: An Often-Forgotten Baseline for Text Classification
Yu-Chen Lin
|
Si-An Chen
|
Jie-Jyun Liu
|
Chih-Jen Lin
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
Large-scale pre-trained language models such as BERT are popular solutions for text classification. Due to the superior performance of these advanced methods, nowadays, people often directly train them for a few epochs and deploy the obtained model. In this opinion paper, we point out that this way may only sometimes get satisfactory results. We argue the importance of running a simple baseline like linear classifiers on bag-of-words features along with advanced methods. First, for many text data, linear methods show competitive performance, high efficiency, and robustness. Second, advanced models such as BERT may only achieve the best results if properly applied. Simple baselines help to confirm whether the results of advanced models are acceptable. Our experimental results fully support these points.
2019
基於深度學習之簡答題問答系統初步探討(A Preliminary Study on Deep Learning-based Short Answer Question Answering System)
Yu-Chen Lin
|
Yuan-Fu Liao
|
Matúš Pleva
|
Daniel Hládek
Proceedings of the 31st Conference on Computational Linguistics and Speech Processing (ROCLING 2019)
Search
Fix data
Co-authors
- Si-An Chen 1
- Jun-cheng Chen 1
- Chu-Song Chen 1
- Daniel Hládek 1
- Wei-Hua Li 1
- show all...