Brian Ulicny
2024
Improving Authorship Privacy: Adaptive Obfuscation with the Dynamic Selection of Techniques
Hemanth Kandula
|
Damianos Karakos
|
Haoling Qiu
|
Brian Ulicny
Proceedings of the Fifth Workshop on Privacy in Natural Language Processing
Authorship obfuscation, the task of rewriting text to protect the original author’s identity, is becoming increasingly important due to the rise of advanced NLP tools for authorship attribution techniques. Traditional methods for authorship obfuscation face significant challenges in balancing content preservation, fluency, and style concealment. This paper introduces a novel approach, the Obfuscation Strategy Optimizer (OSO), which dynamically selects the optimal obfuscation technique based on a combination of metrics including embedding distance, meaning similarity, and fluency. By leveraging an ensemble of language models OSO achieves superior performance in preserving the original content’s meaning and grammatical fluency while effectively concealing the author’s unique writing style. Experimental results demonstrate that the OSO outperforms existing methods and approaches the performance of larger language models. Our evaluation framework incorporates adversarial testing against state-of-the-art attribution systems to validate the robustness of the obfuscation techniques. We release our code publicly at https://github.com/BBN-E/ObfuscationStrategyOptimizer
2006
Lycos Retriever: An Information Fusion Engine
Brian Ulicny
Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
1997
Book Review: Corpus Processing for Lexical Acquisition
Brian Ulicny
Computational Linguistics, Volume 23, Number 1, March 1997
Search