Evaluating the Robustness of Adverse Drug Event Classification Models using Templates

Dorothea MacPhail; David Harbecke; Lisa Raithel; Sebastian Möller

doi:10.18653/v1/2024.bionlp-1.3

Evaluating the Robustness of Adverse Drug Event Classification Models using Templates

Dorothea MacPhail, David Harbecke, Lisa Raithel, Sebastian Möller

Correct Metadata for

Use this form to create a GitHub issue with structured data describing the correction. You will need a GitHub account. Once you create that issue, the correction will be reviewed by a staff member.

⚠️ Mobile Users: Submitting this form to create a new issue will only work with github.com, not the GitHub Mobile app.

Important: The Anthology treat PDFs as authoritative. Please use this form only to correct data that is out of line with the PDF. See our corrections guidelines if you need to change the PDF.

Title Adjust the title. Retain tags such as <fixed-case>.

Authors Adjust author names and order to match the PDF.

Abstract Correct abstract if needed. Retain XML formatting tags such as <tex-math>. You may use <b>...</b> for bold, <i>...</i> for italic, and <url>...</url> for URLs.

Verification against PDF Ensure that the new title/authors match the snapshot below. (If there is no snapshot or it is too small, consult the PDF.)

Authors concatenated from the text boxes above:

ALL author names match the snapshot above—including middle initials, hyphens, and accents.

Abstract

An adverse drug effect (ADE) is any harmful event resulting from medical drug treatment. Despite their importance, ADEs are often under-reported in official channels. Some research has therefore turned to detecting discussions of ADEs in social media. Impressive results have been achieved in various attempts to detect ADEs. In a high-stakes domain such as medicine, however, an in-depth evaluation of a model’s abilities is crucial. We address the issue of thorough performance evaluation in detecting ADEs with hand-crafted templates for four capabilities, temporal order, negation, sentiment and beneficial effect. We find that models with similar performance on held-out test sets have varying results on these capabilities.

Anthology ID:: 2024.bionlp-1.3
Volume:: Proceedings of the 23rd Workshop on Biomedical Natural Language Processing
Month:: August
Year:: 2024
Address:: Bangkok, Thailand
Editors:: Dina Demner-Fushman, Sophia Ananiadou, Makoto Miwa, Kirk Roberts, Junichi Tsujii
Venues:: BioNLP | WS
SIG:: SIGBIOMED
Publisher:: Association for Computational Linguistics
Note:
Pages:: 25–38
Language:
URL:: https://aclanthology.org/2024.bionlp-1.3/
DOI:: 10.18653/v1/2024.bionlp-1.3
Bibkey:
Cite (ACL):: Dorothea MacPhail, David Harbecke, Lisa Raithel, and Sebastian Möller. 2024. Evaluating the Robustness of Adverse Drug Event Classification Models using Templates. In Proceedings of the 23rd Workshop on Biomedical Natural Language Processing, pages 25–38, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):: Evaluating the Robustness of Adverse Drug Event Classification Models using Templates (MacPhail et al., BioNLP 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.bionlp-1.3.pdf

PDF Cite Search Fix data

Export citation

BibTeX
MODS XML
Endnote
Preformatted

@inproceedings{macphail-etal-2024-evaluating,
    title = "Evaluating the Robustness of Adverse Drug Event Classification Models using Templates",
    author = {MacPhail, Dorothea  and
      Harbecke, David  and
      Raithel, Lisa  and
      M{\"o}ller, Sebastian},
    editor = "Demner-Fushman, Dina  and
      Ananiadou, Sophia  and
      Miwa, Makoto  and
      Roberts, Kirk  and
      Tsujii, Junichi",
    booktitle = "Proceedings of the 23rd Workshop on Biomedical Natural Language Processing",
    month = aug,
    year = "2024",
    address = "Bangkok, Thailand",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.bionlp-1.3/",
    doi = "10.18653/v1/2024.bionlp-1.3",
    pages = "25--38",
    abstract = "An adverse drug effect (ADE) is any harmful event resulting from medical drug treatment. Despite their importance, ADEs are often under-reported in official channels. Some research has therefore turned to detecting discussions of ADEs in social media. Impressive results have been achieved in various attempts to detect ADEs. In a high-stakes domain such as medicine, however, an in-depth evaluation of a model{'}s abilities is crucial. We address the issue of thorough performance evaluation in detecting ADEs with hand-crafted templates for four capabilities, temporal order, negation, sentiment and beneficial effect. We find that models with similar performance on held-out test sets have varying results on these capabilities."
}

Download as File

<?xml version="1.0" encoding="UTF-8"?>
<modsCollection xmlns="http://www.loc.gov/mods/v3">
<mods ID="macphail-etal-2024-evaluating">
    <titleInfo>
        <title>Evaluating the Robustness of Adverse Drug Event Classification Models using Templates</title>
    </titleInfo>
    <name type="personal">
        <namePart type="given">Dorothea</namePart>
        <namePart type="family">MacPhail</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">David</namePart>
        <namePart type="family">Harbecke</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Lisa</namePart>
        <namePart type="family">Raithel</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <name type="personal">
        <namePart type="given">Sebastian</namePart>
        <namePart type="family">Möller</namePart>
        <role>
            <roleTerm authority="marcrelator" type="text">author</roleTerm>
        </role>
    </name>
    <originInfo>
        <dateIssued>2024-08</dateIssued>
    </originInfo>
    <typeOfResource>text</typeOfResource>
    <relatedItem type="host">
        <titleInfo>
            <title>Proceedings of the 23rd Workshop on Biomedical Natural Language Processing</title>
        </titleInfo>
        <name type="personal">
            <namePart type="given">Dina</namePart>
            <namePart type="family">Demner-Fushman</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Sophia</namePart>
            <namePart type="family">Ananiadou</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Makoto</namePart>
            <namePart type="family">Miwa</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Kirk</namePart>
            <namePart type="family">Roberts</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <name type="personal">
            <namePart type="given">Junichi</namePart>
            <namePart type="family">Tsujii</namePart>
            <role>
                <roleTerm authority="marcrelator" type="text">editor</roleTerm>
            </role>
        </name>
        <originInfo>
            <publisher>Association for Computational Linguistics</publisher>
            <place>
                <placeTerm type="text">Bangkok, Thailand</placeTerm>
            </place>
        </originInfo>
        <genre authority="marcgt">conference publication</genre>
    </relatedItem>
    <abstract>An adverse drug effect (ADE) is any harmful event resulting from medical drug treatment. Despite their importance, ADEs are often under-reported in official channels. Some research has therefore turned to detecting discussions of ADEs in social media. Impressive results have been achieved in various attempts to detect ADEs. In a high-stakes domain such as medicine, however, an in-depth evaluation of a model’s abilities is crucial. We address the issue of thorough performance evaluation in detecting ADEs with hand-crafted templates for four capabilities, temporal order, negation, sentiment and beneficial effect. We find that models with similar performance on held-out test sets have varying results on these capabilities.</abstract>
    <identifier type="citekey">macphail-etal-2024-evaluating</identifier>
    <identifier type="doi">10.18653/v1/2024.bionlp-1.3</identifier>
    <location>
        <url>https://aclanthology.org/2024.bionlp-1.3/</url>
    </location>
    <part>
        <date>2024-08</date>
        <extent unit="page">
            <start>25</start>
            <end>38</end>
        </extent>
    </part>
</mods>
</modsCollection>

Download as File

%0 Conference Proceedings
%T Evaluating the Robustness of Adverse Drug Event Classification Models using Templates
%A MacPhail, Dorothea
%A Harbecke, David
%A Raithel, Lisa
%A Möller, Sebastian
%Y Demner-Fushman, Dina
%Y Ananiadou, Sophia
%Y Miwa, Makoto
%Y Roberts, Kirk
%Y Tsujii, Junichi
%S Proceedings of the 23rd Workshop on Biomedical Natural Language Processing
%D 2024
%8 August
%I Association for Computational Linguistics
%C Bangkok, Thailand
%F macphail-etal-2024-evaluating
%X An adverse drug effect (ADE) is any harmful event resulting from medical drug treatment. Despite their importance, ADEs are often under-reported in official channels. Some research has therefore turned to detecting discussions of ADEs in social media. Impressive results have been achieved in various attempts to detect ADEs. In a high-stakes domain such as medicine, however, an in-depth evaluation of a model’s abilities is crucial. We address the issue of thorough performance evaluation in detecting ADEs with hand-crafted templates for four capabilities, temporal order, negation, sentiment and beneficial effect. We find that models with similar performance on held-out test sets have varying results on these capabilities.
%R 10.18653/v1/2024.bionlp-1.3
%U https://aclanthology.org/2024.bionlp-1.3/
%U https://doi.org/10.18653/v1/2024.bionlp-1.3
%P 25-38

Download as File

Markdown (Informal)

[Evaluating the Robustness of Adverse Drug Event Classification Models using Templates](https://aclanthology.org/2024.bionlp-1.3/) (MacPhail et al., BioNLP 2024)

Evaluating the Robustness of Adverse Drug Event Classification Models using Templates (MacPhail et al., BioNLP 2024)

ACL

Dorothea MacPhail, David Harbecke, Lisa Raithel, and Sebastian Möller. 2024. Evaluating the Robustness of Adverse Drug Event Classification Models using Templates. In Proceedings of the 23rd Workshop on Biomedical Natural Language Processing, pages 25–38, Bangkok, Thailand. Association for Computational Linguistics.