Learning to Combine AI Annotations for Improved Biomedical Relevance Labeling

Won Gyu Kim; Lana Yeganova; Shubo Tian; Donald C. Comeau; W. John Wilbur; Zhiyong Lu

Learning to Combine AI Annotations for Improved Biomedical Relevance Labeling

Won Gyu Kim, Lana Yeganova, Shubo Tian, Donald Comeau, W John Wilbur, Zhiyong Lu

Abstract

Accurate labeling of relevance between biomedical abstracts is essential for improving information retrieval, semantic similarity modeling, training of ranking systems and other Natural Language Processing tasks. However, manual annotations are time-consuming, labor intensive and costly. Studies show that large language models (LLMs) can facilitate automated annotation, but their performance still falls short of human expert-level accuracy, especially in domain-specific tasks. It has been shown that combining annotations from multiple non-expert annotators can achieve performance comparable to, or even exceeding, that of trained experts. Based on this evidence, we treat AI-generated annotations as contributions from non-expert annotators and combine them using Learning to Rank framework. Our results show significant improvement in overall annotation quality. The proposed method looks promising to reduce reliance on human annotation while maintaining reliable performance for large-scale biomedical applications.

Anthology ID:: 2026.bionlp-1.40
Volume:: BioNLP 2026
Month:: July
Year:: 2026
Address:: San Diego, California
Editors:: Dina Demner-Fushman, Sophia Ananiadou, Kirk Roberts, Junichi Tsujii
Venues:: BioNLP | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 502–507
Language:
URL:: https://aclanthology.org/2026.bionlp-1.40/
DOI:
Bibkey:
Cite (ACL):: Won Gyu Kim, Lana Yeganova, Shubo Tian, Donald Comeau, W John Wilbur, and Zhiyong Lu. 2026. Learning to Combine AI Annotations for Improved Biomedical Relevance Labeling. In BioNLP 2026, pages 502–507, San Diego, California. Association for Computational Linguistics.
Cite (Informal):: Learning to Combine AI Annotations for Improved Biomedical Relevance Labeling (Kim et al., BioNLP 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.bionlp-1.40.pdf

PDF Cite Search Fix data