Generating subject-matter expertise assessment questions with GPT-4: a medical translation use-case

Diana Silveira, Marina Torrón, Helena Moniz


Abstract
This paper examines the suitability of a large language model (LLM), GPT-4, for generating multiple choice questions (MCQs) aimed at assessing subject matter expertise (SME) in the domain of medical translation. The main objective of these questions is to model the skills of potential subject matter experts in a human-in-the-loop machine translation (MT) flow, to ensure that tasks are matched to the individuals with the right skill profile. The investigation was conducted at Unbabel, an artificial intelligence-powered human translation platform. Two medical translation experts evaluated the GPT-4-generated questions and answers, one focusing on English–European Portuguese, and the other on English–German. We present a methodology for creating prompts to elicit high-quality GPT-4 outputs for this use case, as well as for designing evaluation scorecards for human review of such output. Our findings suggest that GPT-4 has the potential to generate suitable items for subject matter expertise tests, providing a more efficient approach compared to relying solely on humans. Furthermore, we propose recommendations for future research to build on our approach and refine the quality of the outputs generated by LLMs.
Anthology ID:
2024.eamt-1.53
Volume:
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1)
Month:
June
Year:
2024
Address:
Sheffield, UK
Editors:
Carolina Scarton, Charlotte Prescott, Chris Bayliss, Chris Oakley, Joanna Wright, Stuart Wrigley, Xingyi Song, Edward Gow-Smith, Rachel Bawden, Víctor M Sánchez-Cartagena, Patrick Cadwell, Ekaterina Lapshinova-Koltunski, Vera Cabarrão, Konstantinos Chatzitheodorou, Mary Nurminen, Diptesh Kanojia, Helena Moniz
Venue:
EAMT
SIG:
Publisher:
European Association for Machine Translation (EAMT)
Note:
Pages:
628–635
Language:
URL:
https://aclanthology.org/2024.eamt-1.53
DOI:
Bibkey:
Cite (ACL):
Diana Silveira, Marina Torrón, and Helena Moniz. 2024. Generating subject-matter expertise assessment questions with GPT-4: a medical translation use-case. In Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1), pages 628–635, Sheffield, UK. European Association for Machine Translation (EAMT).
Cite (Informal):
Generating subject-matter expertise assessment questions with GPT-4: a medical translation use-case (Silveira et al., EAMT 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.eamt-1.53.pdf