Mitigating Data Shift of Biomedical Research Articles for Information Retrieval and Semantic Indexing

Nima Ebadi, Anthony Rios, Peyman Najafirad


Abstract
Researchers have explored novel methods for both semantic indexing and information retrieval of biomedical research articles. Moreover, most solutions treat each task independently. However, both tasks are related. For instance, semantic indexes are generally used to filter results from an information retrieval system. Hence, one task can potentially improve the performance of models trained for the other task. Thus, this study proposes a unified retriever-ranker-based model to tackle the tasks of information retrieval (IR) and semantic indexing (SI). Particularly, our proposed model can adapt to rapid shifts in scientific research. Our results show that the model effectively leverages task similarity to improve the robustness to dataset shift. For SI, the Micro f1 score increases by 8% and the LCA-F score improves by 5%. For IR, the MAP increases by 5% on average.
Anthology ID:
2022.sdp-1.15
Volume:
Proceedings of the Third Workshop on Scholarly Document Processing
Month:
October
Year:
2022
Address:
Gyeongju, Republic of Korea
Editors:
Arman Cohan, Guy Feigenblat, Dayne Freitag, Tirthankar Ghosal, Drahomira Herrmannova, Petr Knoth, Kyle Lo, Philipp Mayr, Michal Shmueli-Scheuer, Anita de Waard, Lucy Lu Wang
Venue:
sdp
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
138–151
Language:
URL:
https://aclanthology.org/2022.sdp-1.15
DOI:
Bibkey:
Cite (ACL):
Nima Ebadi, Anthony Rios, and Peyman Najafirad. 2022. Mitigating Data Shift of Biomedical Research Articles for Information Retrieval and Semantic Indexing. In Proceedings of the Third Workshop on Scholarly Document Processing, pages 138–151, Gyeongju, Republic of Korea. Association for Computational Linguistics.
Cite (Informal):
Mitigating Data Shift of Biomedical Research Articles for Information Retrieval and Semantic Indexing (Ebadi et al., sdp 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.sdp-1.15.pdf
Data
BioASQTREC-COVID