Private Language Models via Truncated Laplacian Mechanism

Tianhao Huang; Tao Yang; Ivan Habernal; Lijie Hu; Di Wang

doi:10.18653/v1/2024.emnlp-main.231

Private Language Models via Truncated Laplacian Mechanism

Tianhao Huang, Tao Yang, Ivan Habernal, Lijie Hu, Di Wang

Abstract

Recently it has been shown that deep learning models for NLP tasks are prone to attacks that can even reconstruct the verbatim training texts. To prevent privacy leakage, researchers have investigated word-level perturbations, relying on the formal guarantees of differential privacy (DP) in the embedding space. However, many existing approaches either achieve unsatisfactory performance in the high privacy regime when using the Laplacian or Gaussian mechanism, or resort to weaker relaxations of DP that are inferior to the canonical DP in terms of privacy strength. This raises the question of whether a new method for private word embedding can be designed to overcome these limitations. In this paper, we propose a novel private embedding method called the high dimensional truncated Laplacian mechanism. Specifically, we introduce a non-trivial extension of the truncated Laplacian mechanism, which was previously only investigated in one-dimensional space cases. Theoretically, we show that our method has a lower variance compared to the previous private word embedding methods. To further validate its effectiveness, we conduct comprehensive experiments on private embedding and downstream tasks using three datasets. Remarkably, even in the high privacy regime, our approach only incurs a slight decrease in utility compared to the non-private scenario.

Anthology ID:: 2024.emnlp-main.231
Volume:: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2024
Address:: Miami, Florida, USA
Editors:: Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 3980–3993
Language:
URL:: https://aclanthology.org/2024.emnlp-main.231/
DOI:: 10.18653/v1/2024.emnlp-main.231
Bibkey:
Cite (ACL):: Tianhao Huang, Tao Yang, Ivan Habernal, Lijie Hu, and Di Wang. 2024. Private Language Models via Truncated Laplacian Mechanism. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pages 3980–3993, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):: Private Language Models via Truncated Laplacian Mechanism (Huang et al., EMNLP 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.emnlp-main.231.pdf

PDF Cite Search Fix data