Ruoxi Jia
2024
Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs
Bilgehan Sel
|
Priya Shanmugasundaram
|
Mohammad Kachuee
|
Kun Zhou
|
Ruoxi Jia
|
Ming Jin
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
How Johnny Can Persuade LLMs to Jailbreak Them: Rethinking Persuasion to Challenge AI Safety by Humanizing LLMs
Yi Zeng
|
Hongpeng Lin
|
Jingwen Zhang
|
Diyi Yang
|
Ruoxi Jia
|
Weiyan Shi
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
2022
Selective Differential Privacy for Language Modeling
Weiyan Shi
|
Aiqi Cui
|
Evan Li
|
Ruoxi Jia
|
Zhou Yu
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Just Fine-tune Twice: Selective Differential Privacy for Large Language Models
Weiyan Shi
|
Ryan Shea
|
Si Chen
|
Chiyuan Zhang
|
Ruoxi Jia
|
Zhou Yu
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing