Chatbot To Help Patients Understand Their Health

Won Seok Jang; Hieu Tran; Manav Shaileshkumar Mistry; Sai Kiran Gandluri; Yifan Zhang; Sharmin Sultana; Sunjae Kwon; Yuan Zhang; Zonghai Yao; Hong Yu

doi:10.18653/v1/2025.findings-emnlp.351

Chatbot To Help Patients Understand Their Health

Won Seok Jang, Hieu Tran, Manav Shaileshkumar Mistry, Sai Kiran Gandluri, Yifan Zhang, Sharmin Sultana, Sunjae Kwon, Yuan Zhang, Zonghai Yao, Hong Yu

Abstract

Patients must possess the knowledge necessary to actively participate in their care. To this end, we developed NoteAid-Chatbot, a conversational AI designed to help patients better understand their health through a novel framework of learning as conversation. We introduce a new learning paradigm that leverages a multi-agent large language model (LLM) and reinforcement learning (RL) framework—without relying on costly human-generated training data. Specifically, NoteAid-Chatbot was built on a lightweight 3-billion-parameter LLaMA 3.2 model using a two-stage training approach: initial supervised fine-tuning on conversational data synthetically generated using medical conversation strategies, followed by RL with rewards derived from patient understanding assessments in simulated hospital discharge scenarios. Our evaluation, which includes comprehensive human-aligned assessments and case studies, demonstrates that NoteAid-Chatbot exhibits key emergent behaviors critical for patient education—such as clarity, relevance, and structured dialogue—even though it received no explicit supervision for these attributes. Our results show that even simple Proximal Policy Optimization (PPO)-based reward modeling can successfully train lightweight, domain-specific chatbots to handle multi-turn interactions, incorporate diverse educational strategies, and meet nuanced communication objectives. Our Turing test demonstrates that NoteAid-Chatbot surpasses non-expert human. Although our current focus is on healthcare, the framework we present illustrates the feasibility and promise of applying low-cost, PPO-based RL to realistic, open-ended conversational domains—broadening the applicability of RL-based alignment methods.

Anthology ID:: 2025.findings-emnlp.351
Volume:: Findings of the Association for Computational Linguistics: EMNLP 2025
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 6598–6627
Language:
URL:: https://aclanthology.org/2025.findings-emnlp.351/
DOI:: 10.18653/v1/2025.findings-emnlp.351
Bibkey:
Cite (ACL):: Won Seok Jang, Hieu Tran, Manav Shaileshkumar Mistry, Sai Kiran Gandluri, Yifan Zhang, Sharmin Sultana, Sunjae Kwon, Yuan Zhang, Zonghai Yao, and Hong Yu. 2025. Chatbot To Help Patients Understand Their Health. In Findings of the Association for Computational Linguistics: EMNLP 2025, pages 6598–6627, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Chatbot To Help Patients Understand Their Health (Jang et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-emnlp.351.pdf
Checklist:: 2025.findings-emnlp.351.checklist.pdf

PDF Cite Search Checklist Fix data