Shwai He
2023
Merging Experts into One: Improving Computational Efficiency of Mixture of Experts
Shwai He
|
Run-Ze Fan
|
Liang Ding
|
Li Shen
|
Tianyi Zhou
|
Dacheng Tao
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
PAD-Net: An Efficient Framework for Dynamic Networks
Shwai He
|
Liang Ding
|
Daize Dong
|
Boan Liu
|
Fuqiang Yu
|
Dacheng Tao
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
2022
Vega-MT: The JD Explore Academy Machine Translation System for WMT22
Changtong Zan
|
Keqin Peng
|
Liang Ding
|
Baopu Qiu
|
Boan Liu
|
Shwai He
|
Qingyu Lu
|
Zheng Zhang
|
Chuang Liu
|
Weifeng Liu
|
Yibing Zhan
|
Dacheng Tao
Proceedings of the Seventh Conference on Machine Translation (WMT)
SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters
Shwai He
|
Liang Ding
|
Daize Dong
|
Jeremy Zhang
|
Dacheng Tao
Findings of the Association for Computational Linguistics: EMNLP 2022
Co-authors
- Liang Ding 4
- Dacheng Tao 4
- Boan Liu 2
- Daize Dong 2
- Run-Ze Fan 1
- show all...