ChicHealth @ MEDIQA 2021: Exploring the limits of pre-trained seq2seq models for medical summarization

Liwen Xu; Yan Zhang; Lei Hong; Yi Cai; Szui Sung

doi:10.18653/v1/2021.bionlp-1.29

ChicHealth @ MEDIQA 2021: Exploring the limits of pre-trained seq2seq models for medical summarization

Liwen Xu, Yan Zhang, Lei Hong, Yi Cai, Szui Sung

Correct Metadata for

Use this form to create a GitHub issue with structured data describing the correction. You will need a GitHub account. Once you create that issue, the correction will be reviewed by a staff member.

⚠️ Mobile Users: Submitting this form to create a new issue will only work with github.com, not the GitHub Mobile app.

Important: The Anthology treat PDFs as authoritative. Please use this form only to correct data that is out of line with the PDF. See our corrections guidelines if you need to change the PDF.

Title Adjust the title. Retain tags such as <fixed-case>.

Authors Adjust author names and order to match the PDF.

Abstract Correct abstract if needed. Retain XML formatting tags such as <tex-math>. You may use <b>...</b> for bold, <i>...</i> for italic, and <url>...</url> for URLs.

Verification against PDF Ensure that the new title/authors match the snapshot below. (If there is no snapshot or it is too small, consult the PDF.)

Authors concatenated from the text boxes above:

ALL author names match the snapshot above—including middle initials, hyphens, and accents.

Abstract

In this article, we will describe our system for MEDIQA2021 shared tasks. First, we will describe the method of the second task, multiple answer summary (MAS). For extracting abstracts, we follow the rules of (CITATION). First, the candidate sentences are roughly estimated by using the Roberta model. Then the Markov chain model is used to evaluate the sentences in a fine-grained manner. Our team won the first place in overall performance, with the fourth place in MAS task, the seventh place in RRS task and the eleventh place in QS task. For the QS and RRS tasks, we investigate the performanceS of the end-to-end pre-trained seq2seq model. Experiments show that the methods of adversarial training and reverse translation are beneficial to improve the fine tuning performance.

Anthology ID:: 2021.bionlp-1.29
Volume:: Proceedings of the 20th Workshop on Biomedical Language Processing
Month:: June
Year:: 2021
Address:: Online
Editors:: Dina Demner-Fushman, Kevin Bretonnel Cohen, Sophia Ananiadou, Junichi Tsujii
Venue:: BioNLP
SIG:: SIGBIOMED
Publisher:: Association for Computational Linguistics
Note:
Pages:: 263–267
Language:
URL:: https://aclanthology.org/2021.bionlp-1.29/
DOI:: 10.18653/v1/2021.bionlp-1.29
Bibkey:
Cite (ACL):: Liwen Xu, Yan Zhang, Lei Hong, Yi Cai, and Szui Sung. 2021. ChicHealth @ MEDIQA 2021: Exploring the limits of pre-trained seq2seq models for medical summarization. In Proceedings of the 20th Workshop on Biomedical Language Processing, pages 263–267, Online. Association for Computational Linguistics.
Cite (Informal):: ChicHealth @ MEDIQA 2021: Exploring the limits of pre-trained seq2seq models for medical summarization (Xu et al., BioNLP 2021)
Copy Citation:
PDF:: https://aclanthology.org/2021.bionlp-1.29.pdf

PDF Cite Search Fix data

Export citation

BibTeX
MODS XML
Endnote
Preformatted

@inproceedings{xu-etal-2021-chichealth,
    title = "{C}hic{H}ealth @ {MEDIQA} 2021: Exploring the limits of pre-trained seq2seq models for medical summarization",
    author = "Xu, Liwen  and
      Zhang, Yan  and
      Hong, Lei  and
      Cai, Yi  and
      Sung, Szui",
    editor = "Demner-Fushman, Dina  and
      Cohen, Kevin Bretonnel  and
      Ananiadou, Sophia  and
      Tsujii, Junichi",
    booktitle = "Proceedings of the 20th Workshop on Biomedical Language Processing",
    month = jun,
    year = "2021",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2021.bionlp-1.29/",
    doi = "10.18653/v1/2021.bionlp-1.29",
    pages = "263--267",
    abstract = "In this article, we will describe our system for MEDIQA2021 shared tasks. First, we will describe the method of the second task, multiple answer summary (MAS). For extracting abstracts, we follow the rules of (CITATION). First, the candidate sentences are roughly estimated by using the Roberta model. Then the Markov chain model is used to evaluate the sentences in a fine-grained manner. Our team won the first place in overall performance, with the fourth place in MAS task, the seventh place in RRS task and the eleventh place in QS task. For the QS and RRS tasks, we investigate the performanceS of the end-to-end pre-trained seq2seq model. Experiments show that the methods of adversarial training and reverse translation are beneficial to improve the fine tuning performance."
}

Download as File

<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3">
<mods ID="xu-etal-2021-chichealth">
    <titleInfo>
        <title>ChicHealth @ MEDIQA 2021: Exploring the limits of pre-trained seq2seq models for medical summarization</title>
    </titleInfo>
    <name type="personal">
        <namePart type="given">Liwen</namePart>
        <namePart type="family">Xu</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Yan</namePart>
        <namePart type="family">Zhang</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Lei</namePart>
        <namePart type="family">Hong</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Yi</namePart>
        <namePart type="family">Cai</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Szui</namePart>
        <namePart type="family">Sung</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <originInfo>
        <dateIssued>2021-06</dateIssued>
    </originInfo>
    <typeOfResource>text</typeOfResource>
    <relatedItem type="host">
        <titleInfo>
            <title>Proceedings of the 20th Workshop on Biomedical Language Processing</title>
        </titleInfo>
        <name type="personal">
            <namePart type="given">Dina</namePart>
            <namePart type="family">Demner-Fushman</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Kevin</namePart>
            <namePart type="given">Bretonnel</namePart>
            <namePart type="family">Cohen</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Sophia</namePart>
            <namePart type="family">Ananiadou</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Junichi</namePart>
            <namePart type="family">Tsujii</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <originInfo>
            <publisher>Association for Computational Linguistics</publisher>
            <place>
                <placeTerm type="text">Online</placeTerm>
            </place>
        </originInfo>
        <genre authority="marcgt">conference publication</genre>
    </relatedItem>
    <abstract>In this article, we will describe our system for MEDIQA2021 shared tasks. First, we will describe the method of the second task, multiple answer summary (MAS). For extracting abstracts, we follow the rules of (CITATION). First, the candidate sentences are roughly estimated by using the Roberta model. Then the Markov chain model is used to evaluate the sentences in a fine-grained manner. Our team won the first place in overall performance, with the fourth place in MAS task, the seventh place in RRS task and the eleventh place in QS task. For the QS and RRS tasks, we investigate the performanceS of the end-to-end pre-trained seq2seq model. Experiments show that the methods of adversarial training and reverse translation are beneficial to improve the fine tuning performance.</abstract>
    <identifier type="citekey">xu-etal-2021-chichealth</identifier>
    <identifier type="doi">10.18653/v1/2021.bionlp-1.29</identifier>
    <location>
        <url>https://aclanthology.org/2021.bionlp-1.29/</url>
    </location>
    <part>
        <date>2021-06</date>
        <extent unit="page">
            <start>263</start>
            <end>267</end>
        </extent>
    </part>
</mods>
</modsCollection>

Download as File

%0 Conference Proceedings
%T ChicHealth @ MEDIQA 2021: Exploring the limits of pre-trained seq2seq models for medical summarization
%A Xu, Liwen
%A Zhang, Yan
%A Hong, Lei
%A Cai, Yi
%A Sung, Szui
%Y Demner-Fushman, Dina
%Y Cohen, Kevin Bretonnel
%Y Ananiadou, Sophia
%Y Tsujii, Junichi
%S Proceedings of the 20th Workshop on Biomedical Language Processing
%D 2021
%8 June
%I Association for Computational Linguistics
%C Online
%F xu-etal-2021-chichealth
%X In this article, we will describe our system for MEDIQA2021 shared tasks. First, we will describe the method of the second task, multiple answer summary (MAS). For extracting abstracts, we follow the rules of (CITATION). First, the candidate sentences are roughly estimated by using the Roberta model. Then the Markov chain model is used to evaluate the sentences in a fine-grained manner. Our team won the first place in overall performance, with the fourth place in MAS task, the seventh place in RRS task and the eleventh place in QS task. For the QS and RRS tasks, we investigate the performanceS of the end-to-end pre-trained seq2seq model. Experiments show that the methods of adversarial training and reverse translation are beneficial to improve the fine tuning performance.
%R 10.18653/v1/2021.bionlp-1.29
%U https://aclanthology.org/2021.bionlp-1.29/
%U https://doi.org/10.18653/v1/2021.bionlp-1.29
%P 263-267

Download as File

Markdown (Informal)

[ChicHealth @ MEDIQA 2021: Exploring the limits of pre-trained seq2seq models for medical summarization](https://aclanthology.org/2021.bionlp-1.29/) (Xu et al., BioNLP 2021)

ChicHealth @ MEDIQA 2021: Exploring the limits of pre-trained seq2seq models for medical summarization (Xu et al., BioNLP 2021)

ACL

Liwen Xu, Yan Zhang, Lei Hong, Yi Cai, and Szui Sung. 2021. ChicHealth @ MEDIQA 2021: Exploring the limits of pre-trained seq2seq models for medical summarization. In Proceedings of the 20th Workshop on Biomedical Language Processing, pages 263–267, Online. Association for Computational Linguistics.