SAHA: Samvad AI for Healthcare Assistance

Aditya Kumar; Rakesh Kumar Nayak; Janhavi Naik; Ritesh Kumar; Dhiraj Bhatia; Shreya Agarwal

doi:10.18653/v1/2025.nlpai4health-main.8

SAHA: Samvad AI for Healthcare Assistance

Aditya Kumar, Rakesh Kumar Nayak, Janhavi Naik, Ritesh Kumar, Dhiraj Bhatia, Shreya Agarwal

Abstract

This paper deals with the dual task of developing a medical question answering (QA) system and generating concise summaries of medical dialogue data across nine languages (English and eight Indian languages). The medical dialogue data focuses on two critical health issues: Head and Neck Cancer (HNC) and Cystic Fibrosis (NLP AI4health shared task). The proposed framework utilises a dual approach: a fine-tuned small Multilingual Text-to-Text Transfer Transformer (mT5) model for the conversational summarisation component and a fine-tuned Retrieval Augmented Generation (RAG) system integrating the dense intfloat/e5-large language model for the language-independent QA component. The efficacy of the proposed approaches is demonstrated by achieving promising precision in the QA task. Our framework achieved the highest F1 scores in QA for the three Indian languages, with F1 score of 0.3995 in Marathi, 0.7803 in Bangla, and 0.74759 in Hindi, respectively. We achieved the highest cometscore of 0.5626 on the Gujarati QA test set. For the dialogue summarisation task, our model registered the highest ROUGE-2 and ROUGE-L precision across all eight Indian languages, with English being the sole exception. These results confirm our approach potential to improve e-health in dialogue data for low-resource Indian languages.

Anthology ID:: 2025.nlpai4health-main.8
Volume:: NLP-AI4Health
Month:: December
Year:: 2025
Address:: Mumbai, India
Editors:: Arun Zechariah, Balu Krishna S, Dipti Misra Sharma, Hannah Mary Thomas, Joy Mammen, Parameswari Krishnamurthy, Vandan Mujadia
Venues:: NLP-AI4Health | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 80–85
Language:
URL:: https://aclanthology.org/2025.nlpai4health-main.8/
DOI:: 10.18653/v1/2025.nlpai4health-main.8
Bibkey:
Cite (ACL):: Aditya Kumar, Rakesh Kumar Nayak, Janhavi Naik, Ritesh Kumar, Dhiraj Bhatia, and Shreya Agarwal. 2025. SAHA: Samvad AI for Healthcare Assistance. In NLP-AI4Health, pages 80–85, Mumbai, India. Association for Computational Linguistics.
Cite (Informal):: SAHA: Samvad AI for Healthcare Assistance (Kumar et al., NLP-AI4Health 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.nlpai4health-main.8.pdf

PDF Cite Search Fix data