Protecting Privacy in Classifiers by Token Manipulation

Re’em Harel; Yair Elboher; Yuval Pinter

Protecting Privacy in Classifiers by Token Manipulation

Correct Metadata for

Use this form to create a GitHub issue with structured data describing the correction. You will need a GitHub account. Once you create that issue, the correction will be reviewed by a staff member.

⚠️ Mobile Users: Submitting this form to create a new issue will only work with github.com, not the GitHub Mobile app.

Important: The Anthology treat PDFs as authoritative. Please use this form only to correct data that is out of line with the PDF. See our corrections guidelines if you need to change the PDF.

Title Adjust the title. Retain tags such as <fixed-case>.

Authors Adjust author names and order to match the PDF.

Abstract Correct abstract if needed. Retain XML formatting tags such as <tex-math>. You may use <b>...</b> for bold, <i>...</i> for italic, and <url>...</url> for URLs.

Verification against PDF Ensure that the new title/authors match the snapshot below. (If there is no snapshot or it is too small, consult the PDF.)

Authors concatenated from the text boxes above:

ALL author names match the snapshot above—including middle initials, hyphens, and accents.

Abstract

Using language models as a remote service entails sending private information to an untrusted provider. In addition, potential eavesdroppers can intercept the messages, thereby exposing the information. In this work, we explore the prospects of avoiding such data exposure at the level of text manipulation. We focus on text classification models, examining various token mapping and contextualized manipulation functions in order to see whether classifier accuracy may be maintained while keeping the original text unrecoverable. We find that although some token mapping functions are easy and straightforward to implement, they heavily influence performance on the downstream task, and via a sophisticated attacker can be reconstructed. In comparison, the contextualized manipulation provides an improvement in performance.

Anthology ID:: 2024.privatenlp-1.4
Volume:: Proceedings of the Fifth Workshop on Privacy in Natural Language Processing
Month:: August
Year:: 2024
Address:: Bangkok, Thailand
Editors:: Ivan Habernal, Sepideh Ghanavati, Abhilasha Ravichander, Vijayanta Jain, Patricia Thaine, Timour Igamberdiev, Niloofar Mireshghallah, Oluwaseyi Feyisetan
Venues:: PrivateNLP | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 29–38
Language:
URL:: https://aclanthology.org/2024.privatenlp-1.4/
DOI:
Bibkey:
Cite (ACL):: Re’em Harel, Yair Elboher, and Yuval Pinter. 2024. Protecting Privacy in Classifiers by Token Manipulation. In Proceedings of the Fifth Workshop on Privacy in Natural Language Processing, pages 29–38, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):: Protecting Privacy in Classifiers by Token Manipulation (Harel et al., PrivateNLP 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.privatenlp-1.4.pdf

PDF Cite Search Fix data

Export citation

BibTeX
MODS XML
Endnote
Preformatted

@inproceedings{harel-etal-2024-protecting,
    title = "Protecting Privacy in Classifiers by Token Manipulation",
    author = "Harel, Re{'}em  and
      Elboher, Yair  and
      Pinter, Yuval",
    editor = "Habernal, Ivan  and
      Ghanavati, Sepideh  and
      Ravichander, Abhilasha  and
      Jain, Vijayanta  and
      Thaine, Patricia  and
      Igamberdiev, Timour  and
      Mireshghallah, Niloofar  and
      Feyisetan, Oluwaseyi",
    booktitle = "Proceedings of the Fifth Workshop on Privacy in Natural Language Processing",
    month = aug,
    year = "2024",
    address = "Bangkok, Thailand",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.privatenlp-1.4/",
    pages = "29--38",
    abstract = "Using language models as a remote service entails sending private information to an untrusted provider. In addition, potential eavesdroppers can intercept the messages, thereby exposing the information. In this work, we explore the prospects of avoiding such data exposure at the level of text manipulation. We focus on text classification models, examining various token mapping and contextualized manipulation functions in order to see whether classifier accuracy may be maintained while keeping the original text unrecoverable. We find that although some token mapping functions are easy and straightforward to implement, they heavily influence performance on the downstream task, and via a sophisticated attacker can be reconstructed. In comparison, the contextualized manipulation provides an improvement in performance."
}

Download as File

<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3">
<mods ID="harel-etal-2024-protecting">
    <titleInfo>
        <title>Protecting Privacy in Classifiers by Token Manipulation</title>
    </titleInfo>
    <name type="personal">
        <namePart type="given">Re’em</namePart>
        <namePart type="family">Harel</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Yair</namePart>
        <namePart type="family">Elboher</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Yuval</namePart>
        <namePart type="family">Pinter</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <originInfo>
        <dateIssued>2024-08</dateIssued>
    </originInfo>
    <typeOfResource>text</typeOfResource>
    <relatedItem type="host">
        <titleInfo>
            <title>Proceedings of the Fifth Workshop on Privacy in Natural Language Processing</title>
        </titleInfo>
        <name type="personal">
            <namePart type="given">Ivan</namePart>
            <namePart type="family">Habernal</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Sepideh</namePart>
            <namePart type="family">Ghanavati</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Abhilasha</namePart>
            <namePart type="family">Ravichander</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Vijayanta</namePart>
            <namePart type="family">Jain</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Patricia</namePart>
            <namePart type="family">Thaine</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Timour</namePart>
            <namePart type="family">Igamberdiev</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Niloofar</namePart>
            <namePart type="family">Mireshghallah</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Oluwaseyi</namePart>
            <namePart type="family">Feyisetan</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <originInfo>
            <publisher>Association for Computational Linguistics</publisher>
            <place>
                <placeTerm type="text">Bangkok, Thailand</placeTerm>
            </place>
        </originInfo>
        <genre authority="marcgt">conference publication</genre>
    </relatedItem>
    <abstract>Using language models as a remote service entails sending private information to an untrusted provider. In addition, potential eavesdroppers can intercept the messages, thereby exposing the information. In this work, we explore the prospects of avoiding such data exposure at the level of text manipulation. We focus on text classification models, examining various token mapping and contextualized manipulation functions in order to see whether classifier accuracy may be maintained while keeping the original text unrecoverable. We find that although some token mapping functions are easy and straightforward to implement, they heavily influence performance on the downstream task, and via a sophisticated attacker can be reconstructed. In comparison, the contextualized manipulation provides an improvement in performance.</abstract>
    <identifier type="citekey">harel-etal-2024-protecting</identifier>
    <location>
        <url>https://aclanthology.org/2024.privatenlp-1.4/</url>
    </location>
    <part>
        <date>2024-08</date>
        <extent unit="page">
            <start>29</start>
            <end>38</end>
        </extent>
    </part>
</mods>
</modsCollection>

Download as File

%0 Conference Proceedings
%T Protecting Privacy in Classifiers by Token Manipulation
%A Harel, Re’em
%A Elboher, Yair
%A Pinter, Yuval
%Y Habernal, Ivan
%Y Ghanavati, Sepideh
%Y Ravichander, Abhilasha
%Y Jain, Vijayanta
%Y Thaine, Patricia
%Y Igamberdiev, Timour
%Y Mireshghallah, Niloofar
%Y Feyisetan, Oluwaseyi
%S Proceedings of the Fifth Workshop on Privacy in Natural Language Processing
%D 2024
%8 August
%I Association for Computational Linguistics
%C Bangkok, Thailand
%F harel-etal-2024-protecting
%X Using language models as a remote service entails sending private information to an untrusted provider. In addition, potential eavesdroppers can intercept the messages, thereby exposing the information. In this work, we explore the prospects of avoiding such data exposure at the level of text manipulation. We focus on text classification models, examining various token mapping and contextualized manipulation functions in order to see whether classifier accuracy may be maintained while keeping the original text unrecoverable. We find that although some token mapping functions are easy and straightforward to implement, they heavily influence performance on the downstream task, and via a sophisticated attacker can be reconstructed. In comparison, the contextualized manipulation provides an improvement in performance.
%U https://aclanthology.org/2024.privatenlp-1.4/
%P 29-38

Download as File

Markdown (Informal)

[Protecting Privacy in Classifiers by Token Manipulation](https://aclanthology.org/2024.privatenlp-1.4/) (Harel et al., PrivateNLP 2024)

Protecting Privacy in Classifiers by Token Manipulation (Harel et al., PrivateNLP 2024)

ACL

Re’em Harel, Yair Elboher, and Yuval Pinter. 2024. Protecting Privacy in Classifiers by Token Manipulation. In Proceedings of the Fifth Workshop on Privacy in Natural Language Processing, pages 29–38, Bangkok, Thailand. Association for Computational Linguistics.