MasonNLP at MEDIQA-OE 2025: Assessing Large Language Models for Structured Medical Order Extraction

A H M Rezaul Karim; Ozlem Uzuner

MasonNLP at MEDIQA-OE 2025: Assessing Large Language Models for Structured Medical Order Extraction

Abstract

Medical order extraction is essential for structuring actionable clinical information, supporting decision-making, and enabling downstream applications such as documentation and workflow automation. Orders may be embedded in diverse sources, including electronic health records, discharge summaries, and multi-turn doctor–patient dialogues, and can span categories such as medications, laboratory tests, imaging studies, and follow-up actions. The MEDIQA-OE 2025 shared task focuses on extracting structured medical orders from extended conversational transcripts, requiring the identification of order type, description, reason, and provenance. We present the MasonNLP submission, which ranked 5th among 17 participating teams with 105 total submissions. Our approach uses a general-purpose, instruction-tuned LLaMA-4 17B model without domain-specific fine-tuning, guided by a single in-context example. This few-shot configuration achieved an average F1 score of 37.76, with notable improvements in reason and provenance accuracy. These results demonstrate that large, non-domain-specific LLMs, when paired with effective prompt engineering, can serve as strong, scalable baselines for specialized clinical NLP tasks.

Anthology ID:: 2025.clinicalnlp-1.7
Volume:: Proceedings of the 7th Clinical Natural Language Processing Workshop
Month:: October
Year:: 2025
Address:: Virtual
Editors:: Asma Ben Abacha, Steven Bethard, Danielle Bitterman, Tristan Naumann, Kirk Roberts
Venues:: ClinicalNLP | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 57–67
Language:
URL:: https://aclanthology.org/2025.clinicalnlp-1.7/
DOI:
Bibkey:
Cite (ACL):: A H M Rezaul Karim and Ozlem Uzuner. 2025. MasonNLP at MEDIQA-OE 2025: Assessing Large Language Models for Structured Medical Order Extraction. In Proceedings of the 7th Clinical Natural Language Processing Workshop, pages 57–67, Virtual. Association for Computational Linguistics.
Cite (Informal):: MasonNLP at MEDIQA-OE 2025: Assessing Large Language Models for Structured Medical Order Extraction (Karim & Uzuner, ClinicalNLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.clinicalnlp-1.7.pdf

PDF Cite Search Fix data