Wei Xiong
2024
Mitigating the Alignment Tax of RLHF
Yong Lin
|
Hangyu Lin
|
Wei Xiong
|
Shizhe Diao
|
Jianmeng Liu
|
Jipeng Zhang
|
Rui Pan
|
Haoxiang Wang
|
Wenbin Hu
|
Hanning Zhang
|
Hanze Dong
|
Renjie Pi
|
Han Zhao
|
Nan Jiang
|
Heng Ji
|
Yuan Yao
|
Tong Zhang
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts
Haoxiang Wang
|
Wei Xiong
|
Tengyang Xie
|
Han Zhao
|
Tong Zhang
Findings of the Association for Computational Linguistics: EMNLP 2024
LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models
Shizhe Diao
|
Rui Pan
|
Hanze Dong
|
KaShun Shum
|
Jipeng Zhang
|
Wei Xiong
|
Tong Zhang
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 3: System Demonstrations)
Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards
Haoxiang Wang
|
Yong Lin
|
Wei Xiong
|
Rui Yang
|
Shizhe Diao
|
Shuang Qiu
|
Han Zhao
|
Tong Zhang
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
2022
ZhichunRoad at SemEval-2022 Task 2: Adversarial Training and Contrastive Learning for Multiword Representations
Xuange Cui
|
Wei Xiong
|
Songlin Wang
Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022)
Co-authors
- Tong Zhang 4
- Shizhe Diao 3
- Haoxiang Wang 3
- Han Zhao 3
- Yong Lin 2
- show all...