Pride-Boiler at MedGenVidQA 2026: LLM-Augmented BM25 Retrieval with Corrective Self-Verification for Biomedical Evidence Retrieval

Basil Ebinesar; Keyuan Jiang; Charansai Maddineni; Ashok Raja

Pride-Boiler at MedGenVidQA 2026: LLM-Augmented BM25 Retrieval with Corrective Self-Verification for Biomedical Evidence Retrieval

Basil Ebinesar, Keyuan Jiang, Charansai Maddineni, Ashok Raja

Abstract

This paper describes the Pride-Boiler system submitted to MedGenVidQA 2026 Shared Task A, which asks for retrieving relevant PubMed articles and medical instructional videos in response to consumer health queries. Our approach pairs Pyserini BM25 retrieval with LLM-driven query rewriting and a corrective self-verification loop inspired by the Corrective Retrieval-Augmented Generation (CRAG) paradigm. Given a consumer query, the pipeline first asks Google Gemini to generate clinically optimized search text, one targeting PubMed abstracts with MeSH terms and clinical synonyms, and another targeting video subtitles with procedural action language. BM25 retrieves a broad candidate pool, and Gemini then scores each candidate against the original query, blending its relevance judgment with the normalized lexical signal. A quality grader assesses the top results: if they are judged insufficient, the pipeline triggers a corrective cycle with reformulated terminology and retries up to three attempts. The entire workflow is orchestrated as a LangGraph state machine. In the official shared task evaluation, Pride-Boiler ranked first among all participating systems on PubMed article retrieval, achieving an nDCG of 0.6532 and MAP of 0.5550, both exceeding the organizer-provided Text-RR baseline. Our performance on video (text) retrieval achieves 0.5304 in MAP and 0.5927 in nDCG, outperforming other systems but falling below that of baseline, indicating the structural limitations of lexical matching over noisy subtitle text. We release the pipeline code to support reproducibility on GitHub at https://github.com/basilll007/BioNLP.

Anthology ID:: 2026.bionlp-2.33
Volume:: Proceedings of the BioNLP 2026 (Shared Tasks)
Month:: July
Year:: 2026
Address:: San Diego, California, USA
Editors:: Deepak Gupta, Dina Demner-Fushman
Venues:: BioNLP | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 248–256
Language:
URL:: https://aclanthology.org/2026.bionlp-2.33/
DOI:
Bibkey:
Cite (ACL):: Basil Ebinesar, Keyuan Jiang, Charansai Maddineni, and Ashok Raja. 2026. Pride-Boiler at MedGenVidQA 2026: LLM-Augmented BM25 Retrieval with Corrective Self-Verification for Biomedical Evidence Retrieval. In Proceedings of the BioNLP 2026 (Shared Tasks), pages 248–256, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):: Pride-Boiler at MedGenVidQA 2026: LLM-Augmented BM25 Retrieval with Corrective Self-Verification for Biomedical Evidence Retrieval (Ebinesar et al., BioNLP 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.bionlp-2.33.pdf
Supplementarymaterial:: 2026.bionlp-2.33.SupplementaryMaterial.txt

PDF Cite Search Supplementarymaterial Fix data