Development of a Benchmark Corpus for Medical Device Adverse Event Detection

Susmitha Wunnava, David A. Harris, Florence T. Bourgeois, Timothy A. Miller


Abstract
The U.S. Food and Drug Administration (FDA) collects real-world adverse events, including device-associated deaths, injuries, and malfunctions, through passive reporting to the agency’s Manufacturer and User Facility Device Experience (MAUDE) database. However, this system’s full potential remains untapped given the extensive use of unstructured text in medical device adverse event reports and lack of FDA resources and expertise to properly analyze all available data. In this work, we focus on addressing this limitation through the development of an annotated benchmark corpus to support the design and development of state-of-the-art NLP approaches towards automatic extraction of device-related adverse event information from FDA Medical Device Adverse Event Reports. We develop a dataset of labeled medical device reports from a diverse set of high-risk device types, that can be used for supervised machine learning. We develop annotation guidelines and manually annotate for nine entity types. The resulting dataset contains 935 annotated adverse event reports, containing 12252 annotated spans across the nine entity types. The dataset developed in this work will be made publicly available upon publication.
Anthology ID:
2024.cl4health-1.29
Volume:
Proceedings of the First Workshop on Patient-Oriented Language Processing (CL4Health) @ LREC-COLING 2024
Month:
May
Year:
2024
Address:
Torino, Italia
Editors:
Dina Demner-Fushman, Sophia Ananiadou, Paul Thompson, Brian Ondov
Venues:
CL4Health | WS
SIG:
Publisher:
ELRA and ICCL
Note:
Pages:
240–245
Language:
URL:
https://aclanthology.org/2024.cl4health-1.29
DOI:
Bibkey:
Cite (ACL):
Susmitha Wunnava, David A. Harris, Florence T. Bourgeois, and Timothy A. Miller. 2024. Development of a Benchmark Corpus for Medical Device Adverse Event Detection. In Proceedings of the First Workshop on Patient-Oriented Language Processing (CL4Health) @ LREC-COLING 2024, pages 240–245, Torino, Italia. ELRA and ICCL.
Cite (Informal):
Development of a Benchmark Corpus for Medical Device Adverse Event Detection (Wunnava et al., CL4Health-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.cl4health-1.29.pdf