Extracting Factual Min/Max Age Information from Clinical Trial Studies

Yufang Hou, Debasis Ganguly, Léa Deleris, Francesca Bonin


Abstract
Population age information is an essential characteristic of clinical trials. In this paper, we focus on extracting minimum and maximum (min/max) age values for the study samples from clinical research articles. Specifically, we investigate the use of a neural network model for question answering to address this information extraction task. The min/max age QA model is trained on the massive structured clinical study records from ClinicalTrials.gov. For each article, based on multiple min and max age values extracted from the QA model, we predict both actual min/max age values for the study samples and filter out non-factual age expressions. Our system improves the results over (i) a passage retrieval based IE system and (ii) a CRF-based system by a large margin when evaluated on an annotated dataset consisting of 50 research papers on smoking cessation.
Anthology ID:
W19-1914
Volume:
Proceedings of the 2nd Clinical Natural Language Processing Workshop
Month:
June
Year:
2019
Address:
Minneapolis, Minnesota, USA
Editors:
Anna Rumshisky, Kirk Roberts, Steven Bethard, Tristan Naumann
Venue:
ClinicalNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
107–116
Language:
URL:
https://aclanthology.org/W19-1914
DOI:
10.18653/v1/W19-1914
Bibkey:
Cite (ACL):
Yufang Hou, Debasis Ganguly, Léa Deleris, and Francesca Bonin. 2019. Extracting Factual Min/Max Age Information from Clinical Trial Studies. In Proceedings of the 2nd Clinical Natural Language Processing Workshop, pages 107–116, Minneapolis, Minnesota, USA. Association for Computational Linguistics.
Cite (Informal):
Extracting Factual Min/Max Age Information from Clinical Trial Studies (Hou et al., ClinicalNLP 2019)
Copy Citation:
PDF:
https://aclanthology.org/W19-1914.pdf
Data
CliCR