Balancing Privacy and Utility in Personal LLM Writing Tasks: An Automated Pipeline for Evaluating Anonymizations

Stefan Pasch; Min Chul Cha

doi:10.18653/v1/2025.privatenlp-main.3

Balancing Privacy and Utility in Personal LLM Writing Tasks: An Automated Pipeline for Evaluating Anonymizations

Abstract

Large language models (LLMs) are widely used for personalized tasks involving sensitive information, raising privacy concerns. While anonymization techniques exist, their impact on response quality remains underexplored. This paper introduces a fully automated evaluation framework to assess anonymization strategies in LLM-generated responses. We generate synthetic prompts for three personal tasks—personal introductions, cover letters, and email writing—and apply anonymization techniques that preserve fluency while enabling entity backmapping. We test three anonymization strategies: simple masking, adding context to masked entities, and pseudonymization. Results show minimal response quality loss (roughly 1 point on a 10-point scale) while achieving 97%-99% entity masking. Responses generated with Llama 3.3:70b perform best with simple entity masking, while GPT-4o benefits from contextual cues. This study provides a framework and empirical insights into balancing privacy protection and response quality in LLM applications.

Anthology ID:: 2025.privatenlp-main.3
Volume:: Proceedings of the Sixth Workshop on Privacy in Natural Language Processing
Month:: April
Year:: 2025
Address:: Albuquerque, New Mexico
Editors:: Ivan Habernal, Sepideh Ghanavati, Vijayanta Jain, Timour Igamberdiev, Shomir Wilson
Venues:: PrivateNLP | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 32–41
Language:
URL:: https://aclanthology.org/2025.privatenlp-main.3/
DOI:: 10.18653/v1/2025.privatenlp-main.3
Bibkey:
Cite (ACL):: Stefan Pasch and Min Chul Cha. 2025. Balancing Privacy and Utility in Personal LLM Writing Tasks: An Automated Pipeline for Evaluating Anonymizations. In Proceedings of the Sixth Workshop on Privacy in Natural Language Processing, pages 32–41, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):: Balancing Privacy and Utility in Personal LLM Writing Tasks: An Automated Pipeline for Evaluating Anonymizations (Pasch & Cha, PrivateNLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.privatenlp-main.3.pdf

PDF Cite Search Fix data