Lessons from Natural Language Inference in the Clinical Domain

Alexey Romanov, Chaitanya Shivade


Abstract
State of the art models using deep neural networks have become very good in learning an accurate mapping from inputs to outputs. However, they still lack generalization capabilities in conditions that differ from the ones encountered during training. This is even more challenging in specialized, and knowledge intensive domains, where training data is limited. To address this gap, we introduce MedNLI - a dataset annotated by doctors, performing a natural language inference task (NLI), grounded in the medical history of patients. We present strategies to: 1) leverage transfer learning using datasets from the open domain, (e.g. SNLI) and 2) incorporate domain knowledge from external data and lexical sources (e.g. medical terminologies). Our results demonstrate performance gains using both strategies.
Anthology ID:
D18-1187
Volume:
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
Month:
October-November
Year:
2018
Address:
Brussels, Belgium
Editors:
Ellen Riloff, David Chiang, Julia Hockenmaier, Jun’ichi Tsujii
Venue:
EMNLP
SIG:
SIGDAT
Publisher:
Association for Computational Linguistics
Note:
Pages:
1586–1596
Language:
URL:
https://aclanthology.org/D18-1187
DOI:
10.18653/v1/D18-1187
Bibkey:
Cite (ACL):
Alexey Romanov and Chaitanya Shivade. 2018. Lessons from Natural Language Inference in the Clinical Domain. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 1586–1596, Brussels, Belgium. Association for Computational Linguistics.
Cite (Informal):
Lessons from Natural Language Inference in the Clinical Domain (Romanov & Shivade, EMNLP 2018)
Copy Citation:
PDF:
https://aclanthology.org/D18-1187.pdf
Code
 additional community code
Data
MIMIC-IIIMultiNLISNLI