Evaluating and Explaining Natural Language Generation with GenX

Kayla Duskin; Shivam Sharma; Ji Young Yun; Emily Saldanha; Dustin Arendt

doi:10.18653/v1/2021.dash-1.12

Evaluating and Explaining Natural Language Generation with GenX

Kayla Duskin, Shivam Sharma, Ji Young Yun, Emily Saldanha, Dustin Arendt

Correct Metadata for

Use this form to create a GitHub issue with structured data describing the correction. You will need a GitHub account. Once you create that issue, the correction will be reviewed by a staff member.

⚠️ Mobile Users: Submitting this form to create a new issue will only work with github.com, not the GitHub Mobile app.

Important: The Anthology treat PDFs as authoritative. Please use this form only to correct data that is out of line with the PDF. See our corrections guidelines if you need to change the PDF.

Title Adjust the title. Retain tags such as <fixed-case>.

Authors Adjust author names and order to match the PDF.

Abstract Correct abstract if needed. Retain XML formatting tags such as <tex-math>. You may use <b>...</b> for bold, <i>...</i> for italic, and <url>...</url> for URLs.

Verification against PDF Ensure that the new title/authors match the snapshot below. (If there is no snapshot or it is too small, consult the PDF.)

Authors concatenated from the text boxes above:

ALL author names match the snapshot above—including middle initials, hyphens, and accents.

Abstract

Current methods for evaluation of natural language generation models focus on measuring text quality but fail to probe the model creativity, i.e., its ability to generate novel but coherent text sequences not seen in the training corpus. We present the GenX tool which is designed to enable interactive exploration and explanation of natural language generation outputs with a focus on the detection of memorization. We demonstrate the utility of the tool on two domain-conditioned generation use cases - phishing emails and ACL abstracts.

Anthology ID:: 2021.dash-1.12
Volume:: Proceedings of the Second Workshop on Data Science with Human in the Loop: Language Advances
Month:: June
Year:: 2021
Address:: Online
Editors:: Eduard Dragut, Yunyao Li, Lucian Popa, Slobodan Vucetic
Venue:: DaSH
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 70–78
Language:
URL:: https://aclanthology.org/2021.dash-1.12/
DOI:: 10.18653/v1/2021.dash-1.12
Bibkey:
Cite (ACL):: Kayla Duskin, Shivam Sharma, Ji Young Yun, Emily Saldanha, and Dustin Arendt. 2021. Evaluating and Explaining Natural Language Generation with GenX. In Proceedings of the Second Workshop on Data Science with Human in the Loop: Language Advances, pages 70–78, Online. Association for Computational Linguistics.
Cite (Informal):: Evaluating and Explaining Natural Language Generation with GenX (Duskin et al., DaSH 2021)
Copy Citation:
PDF:: https://aclanthology.org/2021.dash-1.12.pdf

PDF Cite Search Fix data

Export citation

BibTeX
MODS XML
Endnote
Preformatted

@inproceedings{duskin-etal-2021-evaluating,
    title = "Evaluating and Explaining Natural Language Generation with {G}en{X}",
    author = "Duskin, Kayla  and
      Sharma, Shivam  and
      Yun, Ji Young  and
      Saldanha, Emily  and
      Arendt, Dustin",
    editor = "Dragut, Eduard  and
      Li, Yunyao  and
      Popa, Lucian  and
      Vucetic, Slobodan",
    booktitle = "Proceedings of the Second Workshop on Data Science with Human in the Loop: Language Advances",
    month = jun,
    year = "2021",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2021.dash-1.12/",
    doi = "10.18653/v1/2021.dash-1.12",
    pages = "70--78",
    abstract = "Current methods for evaluation of natural language generation models focus on measuring text quality but fail to probe the model creativity, i.e., its ability to generate novel but coherent text sequences not seen in the training corpus. We present the GenX tool which is designed to enable interactive exploration and explanation of natural language generation outputs with a focus on the detection of memorization. We demonstrate the utility of the tool on two domain-conditioned generation use cases - phishing emails and ACL abstracts."
}

Download as File

<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3">
<mods ID="duskin-etal-2021-evaluating">
    <titleInfo>
        <title>Evaluating and Explaining Natural Language Generation with GenX</title>
    </titleInfo>
    <name type="personal">
        <namePart type="given">Kayla</namePart>
        <namePart type="family">Duskin</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Shivam</namePart>
        <namePart type="family">Sharma</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Ji</namePart>
        <namePart type="given">Young</namePart>
        <namePart type="family">Yun</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Emily</namePart>
        <namePart type="family">Saldanha</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Dustin</namePart>
        <namePart type="family">Arendt</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <originInfo>
        <dateIssued>2021-06</dateIssued>
    </originInfo>
    <typeOfResource>text</typeOfResource>
    <relatedItem type="host">
        <titleInfo>
            <title>Proceedings of the Second Workshop on Data Science with Human in the Loop: Language Advances</title>
        </titleInfo>
        <name type="personal">
            <namePart type="given">Eduard</namePart>
            <namePart type="family">Dragut</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Yunyao</namePart>
            <namePart type="family">Li</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Lucian</namePart>
            <namePart type="family">Popa</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Slobodan</namePart>
            <namePart type="family">Vucetic</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <originInfo>
            <publisher>Association for Computational Linguistics</publisher>
            <place>
                <placeTerm type="text">Online</placeTerm>
            </place>
        </originInfo>
        <genre authority="marcgt">conference publication</genre>
    </relatedItem>
    <abstract>Current methods for evaluation of natural language generation models focus on measuring text quality but fail to probe the model creativity, i.e., its ability to generate novel but coherent text sequences not seen in the training corpus. We present the GenX tool which is designed to enable interactive exploration and explanation of natural language generation outputs with a focus on the detection of memorization. We demonstrate the utility of the tool on two domain-conditioned generation use cases - phishing emails and ACL abstracts.</abstract>
    <identifier type="citekey">duskin-etal-2021-evaluating</identifier>
    <identifier type="doi">10.18653/v1/2021.dash-1.12</identifier>
    <location>
        <url>https://aclanthology.org/2021.dash-1.12/</url>
    </location>
    <part>
        <date>2021-06</date>
        <extent unit="page">
            <start>70</start>
            <end>78</end>
        </extent>
    </part>
</mods>
</modsCollection>

Download as File

%0 Conference Proceedings
%T Evaluating and Explaining Natural Language Generation with GenX
%A Duskin, Kayla
%A Sharma, Shivam
%A Yun, Ji Young
%A Saldanha, Emily
%A Arendt, Dustin
%Y Dragut, Eduard
%Y Li, Yunyao
%Y Popa, Lucian
%Y Vucetic, Slobodan
%S Proceedings of the Second Workshop on Data Science with Human in the Loop: Language Advances
%D 2021
%8 June
%I Association for Computational Linguistics
%C Online
%F duskin-etal-2021-evaluating
%X Current methods for evaluation of natural language generation models focus on measuring text quality but fail to probe the model creativity, i.e., its ability to generate novel but coherent text sequences not seen in the training corpus. We present the GenX tool which is designed to enable interactive exploration and explanation of natural language generation outputs with a focus on the detection of memorization. We demonstrate the utility of the tool on two domain-conditioned generation use cases - phishing emails and ACL abstracts.
%R 10.18653/v1/2021.dash-1.12
%U https://aclanthology.org/2021.dash-1.12/
%U https://doi.org/10.18653/v1/2021.dash-1.12
%P 70-78

Download as File

Markdown (Informal)

[Evaluating and Explaining Natural Language Generation with GenX](https://aclanthology.org/2021.dash-1.12/) (Duskin et al., DaSH 2021)

Evaluating and Explaining Natural Language Generation with GenX (Duskin et al., DaSH 2021)

ACL

Kayla Duskin, Shivam Sharma, Ji Young Yun, Emily Saldanha, and Dustin Arendt. 2021. Evaluating and Explaining Natural Language Generation with GenX. In Proceedings of the Second Workshop on Data Science with Human in the Loop: Language Advances, pages 70–78, Online. Association for Computational Linguistics.