Augmenting Data for Sarcasm Detection with Unlabeled Conversation Context

Hankyol Lee; Youngjae Yu; Gunhee Kim

doi:10.18653/v1/2020.figlang-1.2

Augmenting Data for Sarcasm Detection with Unlabeled Conversation Context

Correct Metadata for

Use this form to create a GitHub issue with structured data describing the correction. You will need a GitHub account. Once you create that issue, the correction will be reviewed by a staff member.

⚠️ Mobile Users: Submitting this form to create a new issue will only work with github.com, not the GitHub Mobile app.

Important: The Anthology treat PDFs as authoritative. Please use this form only to correct data that is out of line with the PDF. See our corrections guidelines if you need to change the PDF.

Title Adjust the title. Retain tags such as <fixed-case>.

Authors Adjust author names and order to match the PDF.

Abstract Correct abstract if needed. Retain XML formatting tags such as <tex-math>. You may use <b>...</b> for bold, <i>...</i> for italic, and <url>...</url> for URLs.

Verification against PDF Ensure that the new title/authors match the snapshot below. (If there is no snapshot or it is too small, consult the PDF.)

Authors concatenated from the text boxes above:

ALL author names match the snapshot above—including middle initials, hyphens, and accents.

Abstract

We present a novel data augmentation technique, CRA (Contextual Response Augmentation), which utilizes conversational context to generate meaningful samples for training. We also mitigate the issues regarding unbalanced context lengths by changing the input output format of the model such that it can deal with varying context lengths effectively. Specifically, our proposed model, trained with the proposed data augmentation technique, participated in the sarcasm detection task of FigLang2020, have won and achieves the best performance in both Reddit and Twitter datasets.

Anthology ID:: 2020.figlang-1.2
Volume:: Proceedings of the Second Workshop on Figurative Language Processing
Month:: July
Year:: 2020
Address:: Online
Editors:: Beata Beigman Klebanov, Ekaterina Shutova, Patricia Lichtenstein, Smaranda Muresan, Chee Wee, Anna Feldman, Debanjan Ghosh
Venue:: Fig-Lang
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 12–17
Language:
URL:: https://aclanthology.org/2020.figlang-1.2/
DOI:: 10.18653/v1/2020.figlang-1.2
Bibkey:
Cite (ACL):: Hankyol Lee, Youngjae Yu, and Gunhee Kim. 2020. Augmenting Data for Sarcasm Detection with Unlabeled Conversation Context. In Proceedings of the Second Workshop on Figurative Language Processing, pages 12–17, Online. Association for Computational Linguistics.
Cite (Informal):: Augmenting Data for Sarcasm Detection with Unlabeled Conversation Context (Lee et al., Fig-Lang 2020)
Copy Citation:
PDF:: https://aclanthology.org/2020.figlang-1.2.pdf
Video:: http://slideslive.com/38929696

PDF Cite Search Video Fix data

Export citation

BibTeX
MODS XML
Endnote
Preformatted

@inproceedings{lee-etal-2020-augmenting,
    title = "Augmenting Data for Sarcasm Detection with Unlabeled Conversation Context",
    author = "Lee, Hankyol  and
      Yu, Youngjae  and
      Kim, Gunhee",
    editor = "Klebanov, Beata Beigman  and
      Shutova, Ekaterina  and
      Lichtenstein, Patricia  and
      Muresan, Smaranda  and
      Wee, Chee  and
      Feldman, Anna  and
      Ghosh, Debanjan",
    booktitle = "Proceedings of the Second Workshop on Figurative Language Processing",
    month = jul,
    year = "2020",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2020.figlang-1.2/",
    doi = "10.18653/v1/2020.figlang-1.2",
    pages = "12--17",
    abstract = "We present a novel data augmentation technique, CRA (Contextual Response Augmentation), which utilizes conversational context to generate meaningful samples for training. We also mitigate the issues regarding unbalanced context lengths by changing the input output format of the model such that it can deal with varying context lengths effectively. Specifically, our proposed model, trained with the proposed data augmentation technique, participated in the sarcasm detection task of FigLang2020, have won and achieves the best performance in both Reddit and Twitter datasets."
}

Download as File

<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3">
<mods ID="lee-etal-2020-augmenting">
    <titleInfo>
        <title>Augmenting Data for Sarcasm Detection with Unlabeled Conversation Context</title>
    </titleInfo>
    <name type="personal">
        <namePart type="given">Hankyol</namePart>
        <namePart type="family">Lee</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Youngjae</namePart>
        <namePart type="family">Yu</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Gunhee</namePart>
        <namePart type="family">Kim</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <originInfo>
        <dateIssued>2020-07</dateIssued>
    </originInfo>
    <typeOfResource>text</typeOfResource>
    <relatedItem type="host">
        <titleInfo>
            <title>Proceedings of the Second Workshop on Figurative Language Processing</title>
        </titleInfo>
        <name type="personal">
            <namePart type="given">Beata</namePart>
            <namePart type="given">Beigman</namePart>
            <namePart type="family">Klebanov</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Ekaterina</namePart>
            <namePart type="family">Shutova</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Patricia</namePart>
            <namePart type="family">Lichtenstein</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Smaranda</namePart>
            <namePart type="family">Muresan</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Chee</namePart>
            <namePart type="family">Wee</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Anna</namePart>
            <namePart type="family">Feldman</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Debanjan</namePart>
            <namePart type="family">Ghosh</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <originInfo>
            <publisher>Association for Computational Linguistics</publisher>
            <place>
                <placeTerm type="text">Online</placeTerm>
            </place>
        </originInfo>
        <genre authority="marcgt">conference publication</genre>
    </relatedItem>
    <abstract>We present a novel data augmentation technique, CRA (Contextual Response Augmentation), which utilizes conversational context to generate meaningful samples for training. We also mitigate the issues regarding unbalanced context lengths by changing the input output format of the model such that it can deal with varying context lengths effectively. Specifically, our proposed model, trained with the proposed data augmentation technique, participated in the sarcasm detection task of FigLang2020, have won and achieves the best performance in both Reddit and Twitter datasets.</abstract>
    <identifier type="citekey">lee-etal-2020-augmenting</identifier>
    <identifier type="doi">10.18653/v1/2020.figlang-1.2</identifier>
    <location>
        <url>https://aclanthology.org/2020.figlang-1.2/</url>
    </location>
    <part>
        <date>2020-07</date>
        <extent unit="page">
            <start>12</start>
            <end>17</end>
        </extent>
    </part>
</mods>
</modsCollection>

Download as File

%0 Conference Proceedings
%T Augmenting Data for Sarcasm Detection with Unlabeled Conversation Context
%A Lee, Hankyol
%A Yu, Youngjae
%A Kim, Gunhee
%Y Klebanov, Beata Beigman
%Y Shutova, Ekaterina
%Y Lichtenstein, Patricia
%Y Muresan, Smaranda
%Y Wee, Chee
%Y Feldman, Anna
%Y Ghosh, Debanjan
%S Proceedings of the Second Workshop on Figurative Language Processing
%D 2020
%8 July
%I Association for Computational Linguistics
%C Online
%F lee-etal-2020-augmenting
%X We present a novel data augmentation technique, CRA (Contextual Response Augmentation), which utilizes conversational context to generate meaningful samples for training. We also mitigate the issues regarding unbalanced context lengths by changing the input output format of the model such that it can deal with varying context lengths effectively. Specifically, our proposed model, trained with the proposed data augmentation technique, participated in the sarcasm detection task of FigLang2020, have won and achieves the best performance in both Reddit and Twitter datasets.
%R 10.18653/v1/2020.figlang-1.2
%U https://aclanthology.org/2020.figlang-1.2/
%U https://doi.org/10.18653/v1/2020.figlang-1.2
%P 12-17

Download as File

Markdown (Informal)

[Augmenting Data for Sarcasm Detection with Unlabeled Conversation Context](https://aclanthology.org/2020.figlang-1.2/) (Lee et al., Fig-Lang 2020)

Augmenting Data for Sarcasm Detection with Unlabeled Conversation Context (Lee et al., Fig-Lang 2020)

ACL

Hankyol Lee, Youngjae Yu, and Gunhee Kim. 2020. Augmenting Data for Sarcasm Detection with Unlabeled Conversation Context. In Proceedings of the Second Workshop on Figurative Language Processing, pages 12–17, Online. Association for Computational Linguistics.