Augmenting Black-box LLMs with Medical Textbooks for Biomedical Question Answering

Yubo Wang; Xueguang Ma; Wenhu Chen

doi:10.18653/v1/2024.findings-emnlp.95

Augmenting Black-box LLMs with Medical Textbooks for Biomedical Question Answering

Abstract

Large-scale language models (LLMs) like ChatGPT have demonstrated impressive abilities in generating responses based on human instructions. However, their use in the medical field can be challenging due to their lack of specific, in-depth knowledge. In this study, we present a system called LLMs Augmented with Medical Textbooks (LLM-AMT) designed to enhance the proficiency of LLMs in specialized domains. LLM-AMT integrates authoritative medical textbooks into the LLMs’ framework using plug-and-play modules. These modules include a Query Augmenter, a Hybrid Textbook Retriever, and a Knowledge Self-Refiner. Together, they incorporate authoritative medical knowledge. Additionally, an LLM Reader aids in contextual understanding. Our experimental results on three medical QA tasks demonstrate that LLM-AMT significantly improves response quality, with accuracy gains ranging from 11.6% to 16.6%. Notably, with GPT-4-Turbo as the base model, LLM-AMT outperforms the specialized Med-PaLM 2 model pre-trained on a massive amount of medical corpus by 2-3%. We found that despite being 100 smaller in size, medical textbooks as a retrieval corpus are proven to be a more effective knowledge database than Wikipedia in the medical domain, boosting performance by 7.8%-13.7%.

Anthology ID:: 2024.findings-emnlp.95
Volume:: Findings of the Association for Computational Linguistics: EMNLP 2024
Month:: November
Year:: 2024
Address:: Miami, Florida, USA
Editors:: Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1754–1770
Language:
URL:: https://aclanthology.org/2024.findings-emnlp.95/
DOI:: 10.18653/v1/2024.findings-emnlp.95
Bibkey:
Cite (ACL):: Yubo Wang, Xueguang Ma, and Wenhu Chen. 2024. Augmenting Black-box LLMs with Medical Textbooks for Biomedical Question Answering. In Findings of the Association for Computational Linguistics: EMNLP 2024, pages 1754–1770, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):: Augmenting Black-box LLMs with Medical Textbooks for Biomedical Question Answering (Wang et al., Findings 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.findings-emnlp.95.pdf
Software:: 2024.findings-emnlp.95.software.zip

PDF Cite Search Software Fix data