MeNTi: Bridging Medical Calculator and LLM Agent with Nested Tool Calling

Yakun Zhu; Shaohang Wei; Xu Wang; Kui Xue; Shaoting Zhang; Xiaofan Zhang

doi:10.18653/v1/2025.naacl-long.263

MeNTi: Bridging Medical Calculator and LLM Agent with Nested Tool Calling

Yakun Zhu, Shaohang Wei, Xu Wang, Kui Xue, Shaoting Zhang, Xiaofan Zhang

Abstract

Integrating tools into Large Language Models (LLMs) has facilitated the widespread application. Despite this, in specialized downstream task contexts, reliance solely on tools is insufficient to fully address the complexities of the real world. This particularly restricts the effective deployment of LLMs in fields such as medicine. In this paper, we focus on the downstream tasks of medical calculators, which use standardized tests to assess an individual’s health status. We introduce MeNTi, a universal agent architecture for LLMs. MeNTi integrates a specialized medical toolkit and employs meta-tool and nested calling mechanisms to enhance LLM tool utilization. Specifically, it achieves flexible tool selection and nested tool calling to address practical issues faced in intricate medical scenarios, including calculator selection, slot filling, and unit conversion. To assess the capabilities of LLMs for quantitative assessment throughout the clinical process of calculator scenarios, we introduce CalcQA. This benchmark requires LLMs to use medical calculators to perform calculations and assess patient health status. CalcQA is constructed by professional physicians and includes 100 case-calculator pairs, complemented by a toolkit of 281 medical tools. The experimental results demonstrate significant performance improvements with our framework. This research paves new directions for applying LLMs in demanding scenarios of medicine.

Anthology ID:: 2025.naacl-long.263
Volume:: Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
Month:: April
Year:: 2025
Address:: Albuquerque, New Mexico
Editors:: Luis Chiruzzo, Alan Ritter, Lu Wang
Venue:: NAACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 5097–5116
Language:
URL:: https://aclanthology.org/2025.naacl-long.263/
DOI:: 10.18653/v1/2025.naacl-long.263
Bibkey:
Cite (ACL):: Yakun Zhu, Shaohang Wei, Xu Wang, Kui Xue, Shaoting Zhang, and Xiaofan Zhang. 2025. MeNTi: Bridging Medical Calculator and LLM Agent with Nested Tool Calling. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), pages 5097–5116, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):: MeNTi: Bridging Medical Calculator and LLM Agent with Nested Tool Calling (Zhu et al., NAACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.naacl-long.263.pdf

PDF Cite Search Fix data