Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training

Jaydeep Borkar; Matthew Jagielski; Katherine Lee; Niloofar Mireshghallah; David A. Smith; Christopher A. Choquette-Choo

doi:10.18653/v1/2025.findings-acl.959

Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training

Jaydeep Borkar, Matthew Jagielski, Katherine Lee, Niloofar Mireshghallah, David A. Smith, Christopher A. Choquette-Choo

Abstract

Due to the sensitive nature of personally identifiable information (PII), its owners may have the authority to control its inclusion or request its removal from large-language model (LLM) training. Beyond this, PII may be added or removed from training datasets due to evolving dataset curation techniques, because they were newly scraped for retraining, or because they were included in a new downstream fine-tuning stage. We find that the amount and ease of PII memorization is a dynamic property of a model that evolves throughout training pipelines and depends on commonly altered design choices. We characterize three such novel phenomena: (1) similar-appearing PII seen later in training can elicit memorization of earlier-seen sequences in what we call assisted memorization, and this is a significant factor (in our settings, up to 1/3); (2) adding PII can increase memorization of other PII; and (3) removing PII can lead to other PII being memorized.

Anthology ID:: 2025.findings-acl.959
Volume:: Findings of the Association for Computational Linguistics: ACL 2025
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 18703–18726
Language:
URL:: https://aclanthology.org/2025.findings-acl.959/
DOI:: 10.18653/v1/2025.findings-acl.959
Bibkey:
Cite (ACL):: Jaydeep Borkar, Matthew Jagielski, Katherine Lee, Niloofar Mireshghallah, David A. Smith, and Christopher A. Choquette-Choo. 2025. Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training. In Findings of the Association for Computational Linguistics: ACL 2025, pages 18703–18726, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training (Borkar et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-acl.959.pdf

PDF Cite Search Fix data