Kushagra Agarwal


2020

pdf bib
Information Retrieval and Extraction on COVID-19 Clinical Articles Using Graph Community Detection and Bio-BERT Embeddings
Debasmita Das | Yatin Katyal | Janu Verma | Shashank Dubey | AakashDeep Singh | Kushagra Agarwal | Sourojit Bhaduri | RajeshKumar Ranjan
Proceedings of the 1st Workshop on NLP for COVID-19 at ACL 2020

In this paper, we present an information retrieval system on a corpus of scientific articles related to COVID-19. We build a similarity network on the articles where similarity is determined via shared citations and biological domain-specific sentence embeddings. Ego-splitting community detection on the article network is employed to cluster the articles and then the queries are matched with the clusters. Extractive summarization using BERT and PageRank methods is used to provide responses to the query. We also provide a Question-Answer bot on a small set of intents to demonstrate the efficacy of our model for an information extraction module.