Comparative Analysis of Neural QA models on SQuAD

Soumya Wadhwa; Khyathi Chandu; Eric Nyberg

doi:10.18653/v1/W18-2610

Comparative Analysis of Neural QA models on SQuAD

Soumya Wadhwa, Khyathi Chandu, Eric Nyberg

Correct Metadata for

Use this form to create a GitHub issue with structured data describing the correction. You will need a GitHub account. Once you create that issue, the correction will be reviewed by a staff member.

⚠️ Mobile Users: Submitting this form to create a new issue will only work with github.com, not the GitHub Mobile app.

Important: The Anthology treat PDFs as authoritative. Please use this form only to correct data that is out of line with the PDF. See our corrections guidelines if you need to change the PDF.

Title Adjust the title. Retain tags such as <fixed-case>.

Authors Adjust author names and order to match the PDF.

Abstract Correct abstract if needed. Retain XML formatting tags such as <tex-math>. You may use ... for bold, ... for italic, ... for underline, <sc>...</sc> for small-caps, <tt>...<tt> for typewriter text, <url>...</url> for URLs, <a href=...> for hyperlinks, and <par/> for paragraph breaks.

Verification against PDF Ensure that the new title/authors match the snapshot below. (If there is no snapshot or it is too small, consult the PDF.)

Authors concatenated from the text boxes above:

ALL author names match the snapshot above—including middle initials, hyphens, and accents.

Abstract

The task of Question Answering has gained prominence in the past few decades for testing the ability of machines to understand natural language. Large datasets for Machine Reading have led to the development of neural models that cater to deeper language understanding compared to information retrieval tasks. Different components in these neural architectures are intended to tackle different challenges. As a first step towards achieving generalization across multiple domains, we attempt to understand and compare the peculiarities of existing end-to-end neural models on the Stanford Question Answering Dataset (SQuAD) by performing quantitative as well as qualitative analysis of the results attained by each of them. We observed that prediction errors reflect certain model-specific biases, which we further discuss in this paper.

Anthology ID:: W18-2610
Volume:: Proceedings of the Workshop on Machine Reading for Question Answering
Month:: July
Year:: 2018
Address:: Melbourne, Australia
Editors:: Eunsol Choi, Minjoon Seo, Danqi Chen, Robin Jia, Jonathan Berant
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 89–97
Language:
URL:: https://aclanthology.org/W18-2610/
DOI:: 10.18653/v1/W18-2610
Bibkey:
Cite (ACL):: Soumya Wadhwa, Khyathi Chandu, and Eric Nyberg. 2018. Comparative Analysis of Neural QA models on SQuAD. In Proceedings of the Workshop on Machine Reading for Question Answering, pages 89–97, Melbourne, Australia. Association for Computational Linguistics.
Cite (Informal):: Comparative Analysis of Neural QA models on SQuAD (Wadhwa et al., ACL 2018)
Copy Citation:
PDF:: https://aclanthology.org/W18-2610.pdf

PDF Cite Search Fix data

Export citation

BibTeX
MODS XML
Endnote
Preformatted

@inproceedings{wadhwa-etal-2018-comparative,
    title = "Comparative Analysis of Neural {QA} models on {SQ}u{AD}",
    author = "Wadhwa, Soumya  and
      Chandu, Khyathi  and
      Nyberg, Eric",
    editor = "Choi, Eunsol  and
      Seo, Minjoon  and
      Chen, Danqi  and
      Jia, Robin  and
      Berant, Jonathan",
    booktitle = "Proceedings of the Workshop on Machine Reading for Question Answering",
    month = jul,
    year = "2018",
    address = "Melbourne, Australia",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/W18-2610/",
    doi = "10.18653/v1/W18-2610",
    pages = "89--97",
    abstract = "The task of Question Answering has gained prominence in the past few decades for testing the ability of machines to understand natural language. Large datasets for Machine Reading have led to the development of neural models that cater to deeper language understanding compared to information retrieval tasks. Different components in these neural architectures are intended to tackle different challenges. As a first step towards achieving generalization across multiple domains, we attempt to understand and compare the peculiarities of existing end-to-end neural models on the Stanford Question Answering Dataset (SQuAD) by performing quantitative as well as qualitative analysis of the results attained by each of them. We observed that prediction errors reflect certain model-specific biases, which we further discuss in this paper."
}

Download as File

<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3">
<mods ID="wadhwa-etal-2018-comparative">
    <titleInfo>
        <title>Comparative Analysis of Neural QA models on SQuAD</title>
    </titleInfo>
    <name type="personal">
        <namePart type="given">Soumya</namePart>
        <namePart type="family">Wadhwa</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Khyathi</namePart>
        <namePart type="family">Chandu</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Eric</namePart>
        <namePart type="family">Nyberg</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <originInfo>
        <dateIssued>2018-07</dateIssued>
    </originInfo>
    <typeOfResource>text</typeOfResource>
    <relatedItem type="host">
        <titleInfo>
            <title>Proceedings of the Workshop on Machine Reading for Question Answering</title>
        </titleInfo>
        <name type="personal">
            <namePart type="given">Eunsol</namePart>
            <namePart type="family">Choi</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Minjoon</namePart>
            <namePart type="family">Seo</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Danqi</namePart>
            <namePart type="family">Chen</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Robin</namePart>
            <namePart type="family">Jia</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Jonathan</namePart>
            <namePart type="family">Berant</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <originInfo>
            <publisher>Association for Computational Linguistics</publisher>
            <place>
                <placeTerm type="text">Melbourne, Australia</placeTerm>
            </place>
        </originInfo>
        <genre authority="marcgt">conference publication</genre>
    </relatedItem>
    <abstract>The task of Question Answering has gained prominence in the past few decades for testing the ability of machines to understand natural language. Large datasets for Machine Reading have led to the development of neural models that cater to deeper language understanding compared to information retrieval tasks. Different components in these neural architectures are intended to tackle different challenges. As a first step towards achieving generalization across multiple domains, we attempt to understand and compare the peculiarities of existing end-to-end neural models on the Stanford Question Answering Dataset (SQuAD) by performing quantitative as well as qualitative analysis of the results attained by each of them. We observed that prediction errors reflect certain model-specific biases, which we further discuss in this paper.</abstract>
    <identifier type="citekey">wadhwa-etal-2018-comparative</identifier>
    <identifier type="doi">10.18653/v1/W18-2610</identifier>
    <location>
        <url>https://aclanthology.org/W18-2610/</url>
    </location>
    <part>
        <date>2018-07</date>
        <extent unit="page">
            <start>89</start>
            <end>97</end>
        </extent>
    </part>
</mods>
</modsCollection>

Download as File

%0 Conference Proceedings
%T Comparative Analysis of Neural QA models on SQuAD
%A Wadhwa, Soumya
%A Chandu, Khyathi
%A Nyberg, Eric
%Y Choi, Eunsol
%Y Seo, Minjoon
%Y Chen, Danqi
%Y Jia, Robin
%Y Berant, Jonathan
%S Proceedings of the Workshop on Machine Reading for Question Answering
%D 2018
%8 July
%I Association for Computational Linguistics
%C Melbourne, Australia
%F wadhwa-etal-2018-comparative
%X The task of Question Answering has gained prominence in the past few decades for testing the ability of machines to understand natural language. Large datasets for Machine Reading have led to the development of neural models that cater to deeper language understanding compared to information retrieval tasks. Different components in these neural architectures are intended to tackle different challenges. As a first step towards achieving generalization across multiple domains, we attempt to understand and compare the peculiarities of existing end-to-end neural models on the Stanford Question Answering Dataset (SQuAD) by performing quantitative as well as qualitative analysis of the results attained by each of them. We observed that prediction errors reflect certain model-specific biases, which we further discuss in this paper.
%R 10.18653/v1/W18-2610
%U https://aclanthology.org/W18-2610/
%U https://doi.org/10.18653/v1/W18-2610
%P 89-97

Download as File

Markdown (Informal)

[Comparative Analysis of Neural QA models on SQuAD](https://aclanthology.org/W18-2610/) (Wadhwa et al., ACL 2018)

Comparative Analysis of Neural QA models on SQuAD (Wadhwa et al., ACL 2018)

ACL

Soumya Wadhwa, Khyathi Chandu, and Eric Nyberg. 2018. Comparative Analysis of Neural QA models on SQuAD. In Proceedings of the Workshop on Machine Reading for Question Answering, pages 89–97, Melbourne, Australia. Association for Computational Linguistics.