shs-nlp at RadSum23: Domain-Adaptive Pre-training of Instruction-tuned LLMs for Radiology Report Impression Generation

Sanjeev Kumar Karn; Rikhiya Ghosh; Kusuma P; Oladimeji Farri

doi:10.18653/v1/2023.bionlp-1.57

shs-nlp at RadSum23: Domain-Adaptive Pre-training of Instruction-tuned LLMs for Radiology Report Impression Generation

Sanjeev Kumar Karn, Rikhiya Ghosh, Kusuma P, Oladimeji Farri

Abstract

Instruction-tuned generative large language models (LLMs), such as ChatGPT and Bloomz, possess excellent generalization abilities. However, they face limitations in understanding radiology reports, particularly when generating the IMPRESSIONS section from the FINDINGS section. These models tend to produce either verbose or incomplete IMPRESSIONS, mainly due to insufficient exposure to medical text data during training. We present a system that leverages large-scale medical text data for domain-adaptive pre-training of instruction-tuned LLMs, enhancing their medical knowledge and performance on specific medical tasks. We demonstrate that this system performs better in a zero-shot setting compared to several pretrain-and-finetune adaptation methods on the IMPRESSIONS generation task. Furthermore, it ranks 1st among participating systems in Task 1B: Radiology Report Summarization.

Anthology ID:: 2023.bionlp-1.57
Volume:: The 22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks
Month:: July
Year:: 2023
Address:: Toronto, Canada
Editors:: Dina Demner-fushman, Sophia Ananiadou, Kevin Cohen
Venue:: BioNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 550–556
Language:
URL:: https://aclanthology.org/2023.bionlp-1.57
DOI:: 10.18653/v1/2023.bionlp-1.57
Bibkey:
Cite (ACL):: Sanjeev Kumar Karn, Rikhiya Ghosh, Kusuma P, and Oladimeji Farri. 2023. shs-nlp at RadSum23: Domain-Adaptive Pre-training of Instruction-tuned LLMs for Radiology Report Impression Generation. In The 22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks, pages 550–556, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):: shs-nlp at RadSum23: Domain-Adaptive Pre-training of Instruction-tuned LLMs for Radiology Report Impression Generation (Karn et al., BioNLP 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.bionlp-1.57.pdf

PDF Cite Search