Li Shen
2023
Towards Making the Most of ChatGPT for Machine Translation
Keqin Peng
|
Liang Ding
|
Qihuang Zhong
|
Li Shen
|
Xuebo Liu
|
Min Zhang
|
Yuanxin Ouyang
|
Dacheng Tao
Findings of the Association for Computational Linguistics: EMNLP 2023
Zero-shot Sharpness-Aware Quantization for Pre-trained Language Models
Miaoxi Zhu
|
Qihuang Zhong
|
Li Shen
|
Liang Ding
|
Juhua Liu
|
Bo Du
|
Dacheng Tao
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
Merging Experts into One: Improving Computational Efficiency of Mixture of Experts
Shwai He
|
Run-Ze Fan
|
Liang Ding
|
Li Shen
|
Tianyi Zhou
|
Dacheng Tao
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
2022
On the Complementarity between Pre-Training and Random-Initialization for Resource-Rich Machine Translation
Changtong Zan
|
Liang Ding
|
Li Shen
|
Yu Cao
|
Weifeng Liu
|
Dacheng Tao
Proceedings of the 29th International Conference on Computational Linguistics
Improving Sharpness-Aware Minimization with Fisher Mask for Better Generalization on Language Models
Qihuang Zhong
|
Liang Ding
|
Li Shen
|
Peng Mi
|
Juhua Liu
|
Bo Du
|
Dacheng Tao
Findings of the Association for Computational Linguistics: EMNLP 2022
Co-authors
- Liang Ding 5
- Dacheng Tao 5
- Qihuang Zhong 3
- Juhua Liu 2
- Bo Du 2
- show all...