ViHealthNLI: A Dataset for Vietnamese Natural Language Inference in Healthcare

Huyen Nguyen; Quyen The Ngo; Thanh-Ha Do; Tuan-Anh Hoang

ViHealthNLI: A Dataset for Vietnamese Natural Language Inference in Healthcare

Huyen Nguyen, Quyen The Ngo, Thanh-Ha Do, Tuan-Anh Hoang

Correct Metadata for

Use this form to create a GitHub issue with structured data describing the correction. You will need a GitHub account. Once you create that issue, the correction will be reviewed by a staff member.

⚠️ Mobile Users: Submitting this form to create a new issue will only work with github.com, not the GitHub Mobile app.

Important: The Anthology treat PDFs as authoritative. Please use this form only to correct data that is out of line with the PDF. See our corrections guidelines if you need to change the PDF.

Title Adjust the title. Retain tags such as <fixed-case>.

Authors Adjust author names and order to match the PDF.

Abstract Correct abstract if needed. Retain XML formatting tags such as <tex-math>. You may use <b>...</b> for bold, <i>...</i> for italic, and <url>...</url> for URLs.

Verification against PDF Ensure that the new title/authors match the snapshot below. (If there is no snapshot or it is too small, consult the PDF.)

Authors concatenated from the text boxes above:

ALL author names match the snapshot above—including middle initials, hyphens, and accents.

Abstract

This paper introduces ViHealthNLI, a large dataset for the natural language inference problem for Vietnamese. Unlike the similar Vietnamese datasets, ours is specific to the healthcare domain. We conducted an exploratory analysis to characterize the dataset and evaluated the state-of-the-art methods on the dataset. Our findings indicate that the dataset poses significant challenges while also holding promise for further advanced research and the creation of practical applications.

Anthology ID:: 2024.sigul-1.48
Volume:: Proceedings of the 3rd Annual Meeting of the Special Interest Group on Under-resourced Languages @ LREC-COLING 2024
Month:: May
Year:: 2024
Address:: Torino, Italia
Editors:: Maite Melero, Sakriani Sakti, Claudia Soria
Venues:: SIGUL | WS
SIG:
Publisher:: ELRA and ICCL
Note:
Pages:: 404–409
Language:
URL:: https://aclanthology.org/2024.sigul-1.48/
DOI:
Bibkey:
Cite (ACL):: Huyen Nguyen, Quyen The Ngo, Thanh-Ha Do, and Tuan-Anh Hoang. 2024. ViHealthNLI: A Dataset for Vietnamese Natural Language Inference in Healthcare. In Proceedings of the 3rd Annual Meeting of the Special Interest Group on Under-resourced Languages @ LREC-COLING 2024, pages 404–409, Torino, Italia. ELRA and ICCL.
Cite (Informal):: ViHealthNLI: A Dataset for Vietnamese Natural Language Inference in Healthcare (Nguyen et al., SIGUL 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.sigul-1.48.pdf

PDF Cite Search Fix data

Export citation

BibTeX
MODS XML
Endnote
Preformatted

@inproceedings{nguyen-etal-2024-vihealthnli,
    title = "{V}i{H}ealth{NLI}: A Dataset for {V}ietnamese Natural Language Inference in Healthcare",
    author = "Nguyen, Huyen  and
      Ngo, Quyen The  and
      Do, Thanh-Ha  and
      Hoang, Tuan-Anh",
    editor = "Melero, Maite  and
      Sakti, Sakriani  and
      Soria, Claudia",
    booktitle = "Proceedings of the 3rd Annual Meeting of the Special Interest Group on Under-resourced Languages @ LREC-COLING 2024",
    month = may,
    year = "2024",
    address = "Torino, Italia",
    publisher = "ELRA and ICCL",
    url = "https://aclanthology.org/2024.sigul-1.48/",
    pages = "404--409",
    abstract = "This paper introduces ViHealthNLI, a large dataset for the natural language inference problem for Vietnamese. Unlike the similar Vietnamese datasets, ours is specific to the healthcare domain. We conducted an exploratory analysis to characterize the dataset and evaluated the state-of-the-art methods on the dataset. Our findings indicate that the dataset poses significant challenges while also holding promise for further advanced research and the creation of practical applications."
}

Download as File

<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3">
<mods ID="nguyen-etal-2024-vihealthnli">
    <titleInfo>
        <title>ViHealthNLI: A Dataset for Vietnamese Natural Language Inference in Healthcare</title>
    </titleInfo>
    <name type="personal">
        <namePart type="given">Huyen</namePart>
        <namePart type="family">Nguyen</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Quyen</namePart>
        <namePart type="given">The</namePart>
        <namePart type="family">Ngo</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Thanh-Ha</namePart>
        <namePart type="family">Do</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Tuan-Anh</namePart>
        <namePart type="family">Hoang</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <originInfo>
        <dateIssued>2024-05</dateIssued>
    </originInfo>
    <typeOfResource>text</typeOfResource>
    <relatedItem type="host">
        <titleInfo>
            <title>Proceedings of the 3rd Annual Meeting of the Special Interest Group on Under-resourced Languages @ LREC-COLING 2024</title>
        </titleInfo>
        <name type="personal">
            <namePart type="given">Maite</namePart>
            <namePart type="family">Melero</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Sakriani</namePart>
            <namePart type="family">Sakti</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Claudia</namePart>
            <namePart type="family">Soria</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <originInfo>
            <publisher>ELRA and ICCL</publisher>
            <place>
                <placeTerm type="text">Torino, Italia</placeTerm>
            </place>
        </originInfo>
        <genre authority="marcgt">conference publication</genre>
    </relatedItem>
    <abstract>This paper introduces ViHealthNLI, a large dataset for the natural language inference problem for Vietnamese. Unlike the similar Vietnamese datasets, ours is specific to the healthcare domain. We conducted an exploratory analysis to characterize the dataset and evaluated the state-of-the-art methods on the dataset. Our findings indicate that the dataset poses significant challenges while also holding promise for further advanced research and the creation of practical applications.</abstract>
    <identifier type="citekey">nguyen-etal-2024-vihealthnli</identifier>
    <location>
        <url>https://aclanthology.org/2024.sigul-1.48/</url>
    </location>
    <part>
        <date>2024-05</date>
        <extent unit="page">
            <start>404</start>
            <end>409</end>
        </extent>
    </part>
</mods>
</modsCollection>

Download as File

%0 Conference Proceedings
%T ViHealthNLI: A Dataset for Vietnamese Natural Language Inference in Healthcare
%A Nguyen, Huyen
%A Ngo, Quyen The
%A Do, Thanh-Ha
%A Hoang, Tuan-Anh
%Y Melero, Maite
%Y Sakti, Sakriani
%Y Soria, Claudia
%S Proceedings of the 3rd Annual Meeting of the Special Interest Group on Under-resourced Languages @ LREC-COLING 2024
%D 2024
%8 May
%I ELRA and ICCL
%C Torino, Italia
%F nguyen-etal-2024-vihealthnli
%X This paper introduces ViHealthNLI, a large dataset for the natural language inference problem for Vietnamese. Unlike the similar Vietnamese datasets, ours is specific to the healthcare domain. We conducted an exploratory analysis to characterize the dataset and evaluated the state-of-the-art methods on the dataset. Our findings indicate that the dataset poses significant challenges while also holding promise for further advanced research and the creation of practical applications.
%U https://aclanthology.org/2024.sigul-1.48/
%P 404-409

Download as File

Markdown (Informal)

[ViHealthNLI: A Dataset for Vietnamese Natural Language Inference in Healthcare](https://aclanthology.org/2024.sigul-1.48/) (Nguyen et al., SIGUL 2024)

ViHealthNLI: A Dataset for Vietnamese Natural Language Inference in Healthcare (Nguyen et al., SIGUL 2024)

ACL

Huyen Nguyen, Quyen The Ngo, Thanh-Ha Do, and Tuan-Anh Hoang. 2024. ViHealthNLI: A Dataset for Vietnamese Natural Language Inference in Healthcare. In Proceedings of the 3rd Annual Meeting of the Special Interest Group on Under-resourced Languages @ LREC-COLING 2024, pages 404–409, Torino, Italia. ELRA and ICCL.