Building Certified Medical Chatbots: Overcoming Unstructured Data Limitations with Modular RAG

Leonardo Sanna; Patrizio Bellan; Simone Magnolini; Marina Segala; Saba Ghanbari Haez; Monica Consolandi; Mauro Dragoni

Building Certified Medical Chatbots: Overcoming Unstructured Data Limitations with Modular RAG

Leonardo Sanna, Patrizio Bellan, Simone Magnolini, Marina Segala, Saba Ghanbari Haez, Monica Consolandi, Mauro Dragoni

Abstract

Creating a certified conversational agent poses several issues. The need to manage fine-grained information delivery and the necessity to provide reliable medical information requires a notable effort, especially in dataset preparation. In this paper, we investigate the challenges of building a certified medical chatbot in Italian that provides information about pregnancy and early childhood. We show some negative initial results regarding the possibility of creating a certified conversational agent within the RASA framework starting from unstructured data. Finally, we propose a modular RAG model to implement a Large Language Model in a certified context, overcoming data limitations and enabling data collection on actual conversations.

Anthology ID:: 2024.cl4health-1.15
Volume:: Proceedings of the First Workshop on Patient-Oriented Language Processing (CL4Health) @ LREC-COLING 2024
Month:: May
Year:: 2024
Address:: Torino, Italia
Editors:: Dina Demner-Fushman, Sophia Ananiadou, Paul Thompson, Brian Ondov
Venues:: CL4Health | WS
SIG:
Publisher:: ELRA and ICCL
Note:
Pages:: 124–130
Language:
URL:: https://aclanthology.org/2024.cl4health-1.15/
DOI:
Bibkey:
Cite (ACL):: Leonardo Sanna, Patrizio Bellan, Simone Magnolini, Marina Segala, Saba Ghanbari Haez, Monica Consolandi, and Mauro Dragoni. 2024. Building Certified Medical Chatbots: Overcoming Unstructured Data Limitations with Modular RAG. In Proceedings of the First Workshop on Patient-Oriented Language Processing (CL4Health) @ LREC-COLING 2024, pages 124–130, Torino, Italia. ELRA and ICCL.
Cite (Informal):: Building Certified Medical Chatbots: Overcoming Unstructured Data Limitations with Modular RAG (Sanna et al., CL4Health 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.cl4health-1.15.pdf

PDF Cite Search Fix data