The Evaluation of Medical Terms Complexity Using Lexical Features and Large Language Models

Liliya Makhmutova, Giancarlo Dondoni Salton, Fernando Perez-Tellez, Robert J. Ross


Abstract
Understanding medical terminology is critical for effective patient-doctor communication, yet many patients struggle with complex jargon. This study compares Machine Learning (ML) models and Large Language Models (LLMs) in predicting medical term complexity as a means of improving doctor-patient communication. Using survey data from 252 participants rating 1,000 words along with various lexical features, we measured the accuracy of both model types. The results show that LLMs outperform traditional lexical-feature-based models, suggesting their potential to identify complex medical terms and lay the groundwork for personalised patient-doctor communication.
Anthology ID:
2025.ranlp-1.79
Volume:
Proceedings of the 15th International Conference on Recent Advances in Natural Language Processing - Natural Language Processing in the Generative AI Era
Month:
September
Year:
2025
Address:
Varna, Bulgaria
Editors:
Galia Angelova, Maria Kunilovskaya, Marie Escribe, Ruslan Mitkov
Venue:
RANLP
SIG:
Publisher:
INCOMA Ltd., Shoumen, Bulgaria
Note:
Pages:
682–693
Language:
URL:
https://aclanthology.org/2025.ranlp-1.79/
DOI:
Bibkey:
Cite (ACL):
Liliya Makhmutova, Giancarlo Dondoni Salton, Fernando Perez-Tellez, and Robert J. Ross. 2025. The Evaluation of Medical Terms Complexity Using Lexical Features and Large Language Models. In Proceedings of the 15th International Conference on Recent Advances in Natural Language Processing - Natural Language Processing in the Generative AI Era, pages 682–693, Varna, Bulgaria. INCOMA Ltd., Shoumen, Bulgaria.
Cite (Informal):
The Evaluation of Medical Terms Complexity Using Lexical Features and Large Language Models (Makhmutova et al., RANLP 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.ranlp-1.79.pdf