Do Mixed-Vendor Multi-Agent LLMs Improve Clinical Diagnosis?

Grace Chang Yuan; Xiaoman Zhang; Sung Eun Kim; Pranav Rajpurkar

Do Mixed-Vendor Multi-Agent LLMs Improve Clinical Diagnosis?

Grace Chang Yuan, Xiaoman Zhang, Sung Eun Kim, Pranav Rajpurkar

Abstract

Multi-agent large language model (LLM) systems have emerged as a promising approach for clinical diagnosis, leveraging collaboration among agents to refine medical reasoning. However, most existing frameworks rely on single-vendor teams (e.g., multiple agents from the same model family), which risk correlated failure modes that reinforce shared biases rather than correcting them. We investigate the impact of vendor diversity by comparing Single-LLM, Single-Vendor, and Mixed-Vendor Multi-Agent Conversation (MAC) frameworks. Using three doctor agents instantiated with o4-mini, Gemini-2.5-Pro, and Claude-4.5-Sonnet, we evaluate performance on RareBench and DiagnosisArena. Mixed-vendor configurations consistently outperform single-vendor counterparts, achieving state-of-the-art recall and accuracy. Overlap analysis reveals the underlying mechanism: mixed-vendor teams pool complementary inductive biases, surfacing correct diagnoses that individual models or homogeneous teams collectively miss. These results highlight vendor diversity as a key design principle for robust clinical diagnostic systems.

Anthology ID:: 2026.healing-1.1
Volume:: Proceedings of the 1st Workshop on Linguistic Analysis for Health (HeaLing 2026)
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Vera Danilova, Murathan Kurfalı, Ylva Söderfeldt, Julia Reed, Andrew Burchell
Venues:: HeaLing | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1–18
Language:
URL:: https://aclanthology.org/2026.healing-1.1/
DOI:
Bibkey:
Cite (ACL):: Grace Chang Yuan, Xiaoman Zhang, Sung Eun Kim, and Pranav Rajpurkar. 2026. Do Mixed-Vendor Multi-Agent LLMs Improve Clinical Diagnosis?. In Proceedings of the 1st Workshop on Linguistic Analysis for Health (HeaLing 2026), pages 1–18, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: Do Mixed-Vendor Multi-Agent LLMs Improve Clinical Diagnosis? (Yuan et al., HeaLing 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.healing-1.1.pdf

PDF Cite Search Fix data