Recognizing Textual Entailment in Twitter Using Word Embeddings

Octavia-Maria Şulea

doi:10.18653/v1/W17-5306

Recognizing Textual Entailment in Twitter Using Word Embeddings

Correct Metadata for

Use this form to create a GitHub issue with structured data describing the correction. You will need a GitHub account. Once you create that issue, the correction will be reviewed by a staff member.

⚠️ Mobile Users: Submitting this form to create a new issue will only work with github.com, not the GitHub Mobile app.

Important: The Anthology treat PDFs as authoritative. Please use this form only to correct data that is out of line with the PDF. See our corrections guidelines if you need to change the PDF.

Title Adjust the title. Retain tags such as <fixed-case>.

Authors Adjust author names and order to match the PDF.

Abstract Correct abstract if needed. Retain XML formatting tags such as <tex-math>. You may use ... for bold, ... for italic, ... for underline, <sc>...</sc> for small-caps, <tt>...<tt> for typewriter text, <url>...</url> for URLs, <a href=...> for hyperlinks, and <par/> for paragraph breaks.

Verification against PDF Ensure that the new title/authors match the snapshot below. (If there is no snapshot or it is too small, consult the PDF.)

Authors concatenated from the text boxes above:

ALL author names match the snapshot above—including middle initials, hyphens, and accents.

Abstract

In this paper, we investigate the application of machine learning techniques and word embeddings to the task of Recognizing Textual Entailment (RTE) in Social Media. We look at a manually labeled dataset consisting of user generated short texts posted on Twitter (tweets) and related to four recent media events (the Charlie Hebdo shooting, the Ottawa shooting, the Sydney Siege, and the German Wings crash) and test to what extent neural techniques and embeddings are able to distinguish between tweets that entail or contradict each other or that claim unrelated things. We obtain comparable results to the state of the art in a train-test setting, but we show that, due to the noisy aspect of the data, results plummet in an evaluation strategy crafted to better simulate a real-life train-test scenario.

Anthology ID:: W17-5306
Volume:: Proceedings of the 2nd Workshop on Evaluating Vector Space Representations for NLP
Month:: September
Year:: 2017
Address:: Copenhagen, Denmark
Editors:: Samuel R. Bowman, Yoav Goldberg, Felix Hill, Angeliki Lazaridou, Omer Levy, Roi Reichart, Anders Søgaard
Venue:: RepEval
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 31–35
Language:
URL:: https://aclanthology.org/W17-5306/
DOI:: 10.18653/v1/W17-5306
Bibkey:
Cite (ACL):: Octavia-Maria Şulea. 2017. Recognizing Textual Entailment in Twitter Using Word Embeddings. In Proceedings of the 2nd Workshop on Evaluating Vector Space Representations for NLP, pages 31–35, Copenhagen, Denmark. Association for Computational Linguistics.
Cite (Informal):: Recognizing Textual Entailment in Twitter Using Word Embeddings (Şulea, RepEval 2017)
Copy Citation:
PDF:: https://aclanthology.org/W17-5306.pdf

PDF Cite Search Fix data

Export citation

BibTeX
MODS XML
Endnote
Preformatted

@inproceedings{sulea-2017-recognizing,
    title = "Recognizing Textual Entailment in {T}witter Using Word Embeddings",
    author = "{\c{S}}ulea, Octavia-Maria",
    editor = "Bowman, Samuel R.  and
      Goldberg, Yoav  and
      Hill, Felix  and
      Lazaridou, Angeliki  and
      Levy, Omer  and
      Reichart, Roi  and
      S{\o}gaard, Anders",
    booktitle = "Proceedings of the 2nd Workshop on Evaluating Vector Space Representations for {NLP}",
    month = sep,
    year = "2017",
    address = "Copenhagen, Denmark",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/W17-5306/",
    doi = "10.18653/v1/W17-5306",
    pages = "31--35",
    abstract = "In this paper, we investigate the application of machine learning techniques and word embeddings to the task of Recognizing Textual Entailment (RTE) in Social Media. We look at a manually labeled dataset consisting of user generated short texts posted on Twitter (tweets) and related to four recent media events (the Charlie Hebdo shooting, the Ottawa shooting, the Sydney Siege, and the German Wings crash) and test to what extent neural techniques and embeddings are able to distinguish between tweets that entail or contradict each other or that claim unrelated things. We obtain comparable results to the state of the art in a train-test setting, but we show that, due to the noisy aspect of the data, results plummet in an evaluation strategy crafted to better simulate a real-life train-test scenario."
}

Download as File

<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3">
<mods ID="sulea-2017-recognizing">
    <titleInfo>
        <title>Recognizing Textual Entailment in Twitter Using Word Embeddings</title>
    </titleInfo>
    <name type="personal">
        <namePart type="given">Octavia-Maria</namePart>
        <namePart type="family">Şulea</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <originInfo>
        <dateIssued>2017-09</dateIssued>
    </originInfo>
    <typeOfResource>text</typeOfResource>
    <relatedItem type="host">
        <titleInfo>
            <title>Proceedings of the 2nd Workshop on Evaluating Vector Space Representations for NLP</title>
        </titleInfo>
        <name type="personal">
            <namePart type="given">Samuel</namePart>
            <namePart type="given">R</namePart>
            <namePart type="family">Bowman</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Yoav</namePart>
            <namePart type="family">Goldberg</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Felix</namePart>
            <namePart type="family">Hill</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Angeliki</namePart>
            <namePart type="family">Lazaridou</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Omer</namePart>
            <namePart type="family">Levy</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Roi</namePart>
            <namePart type="family">Reichart</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Anders</namePart>
            <namePart type="family">Søgaard</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <originInfo>
            <publisher>Association for Computational Linguistics</publisher>
            <place>
                <placeTerm type="text">Copenhagen, Denmark</placeTerm>
            </place>
        </originInfo>
        <genre authority="marcgt">conference publication</genre>
    </relatedItem>
    <abstract>In this paper, we investigate the application of machine learning techniques and word embeddings to the task of Recognizing Textual Entailment (RTE) in Social Media. We look at a manually labeled dataset consisting of user generated short texts posted on Twitter (tweets) and related to four recent media events (the Charlie Hebdo shooting, the Ottawa shooting, the Sydney Siege, and the German Wings crash) and test to what extent neural techniques and embeddings are able to distinguish between tweets that entail or contradict each other or that claim unrelated things. We obtain comparable results to the state of the art in a train-test setting, but we show that, due to the noisy aspect of the data, results plummet in an evaluation strategy crafted to better simulate a real-life train-test scenario.</abstract>
    <identifier type="citekey">sulea-2017-recognizing</identifier>
    <identifier type="doi">10.18653/v1/W17-5306</identifier>
    <location>
        <url>https://aclanthology.org/W17-5306/</url>
    </location>
    <part>
        <date>2017-09</date>
        <extent unit="page">
            <start>31</start>
            <end>35</end>
        </extent>
    </part>
</mods>
</modsCollection>

Download as File

%0 Conference Proceedings
%T Recognizing Textual Entailment in Twitter Using Word Embeddings
%A Şulea, Octavia-Maria
%Y Bowman, Samuel R.
%Y Goldberg, Yoav
%Y Hill, Felix
%Y Lazaridou, Angeliki
%Y Levy, Omer
%Y Reichart, Roi
%Y Søgaard, Anders
%S Proceedings of the 2nd Workshop on Evaluating Vector Space Representations for NLP
%D 2017
%8 September
%I Association for Computational Linguistics
%C Copenhagen, Denmark
%F sulea-2017-recognizing
%X In this paper, we investigate the application of machine learning techniques and word embeddings to the task of Recognizing Textual Entailment (RTE) in Social Media. We look at a manually labeled dataset consisting of user generated short texts posted on Twitter (tweets) and related to four recent media events (the Charlie Hebdo shooting, the Ottawa shooting, the Sydney Siege, and the German Wings crash) and test to what extent neural techniques and embeddings are able to distinguish between tweets that entail or contradict each other or that claim unrelated things. We obtain comparable results to the state of the art in a train-test setting, but we show that, due to the noisy aspect of the data, results plummet in an evaluation strategy crafted to better simulate a real-life train-test scenario.
%R 10.18653/v1/W17-5306
%U https://aclanthology.org/W17-5306/
%U https://doi.org/10.18653/v1/W17-5306
%P 31-35

Download as File

Markdown (Informal)

[Recognizing Textual Entailment in Twitter Using Word Embeddings](https://aclanthology.org/W17-5306/) (Şulea, RepEval 2017)

Recognizing Textual Entailment in Twitter Using Word Embeddings (Şulea, RepEval 2017)

ACL

Octavia-Maria Şulea. 2017. Recognizing Textual Entailment in Twitter Using Word Embeddings. In Proceedings of the 2nd Workshop on Evaluating Vector Space Representations for NLP, pages 31–35, Copenhagen, Denmark. Association for Computational Linguistics.