Phrase2VecGLM: Neural generalized language model–based semantic tagging for complex query reformulation in medical IR

Manirupa Das, Eric Fosler-Lussier, Simon Lin, Soheil Moosavinasab, David Chen, Steve Rust, Yungui Huang, Rajiv Ramnath


Abstract
In this work, we develop a novel, completely unsupervised, neural language model-based document ranking approach to semantic tagging of documents, using the document to be tagged as a query into the GLM to retrieve candidate phrases from top-ranked related documents, thus associating every document with novel related concepts extracted from the text. For this we extend the word embedding-based general language model due to Ganguly et al 2015, to employ phrasal embeddings, and use the semantic tags thus obtained for downstream query expansion, both directly and in feedback loop settings. Our method, evaluated using the TREC 2016 clinical decision support challenge dataset, shows statistically significant improvement not only over various baselines that use standard MeSH terms and UMLS concepts for query expansion, but also over baselines using human expert–assigned concept tags for the queries, run on top of a standard Okapi BM25–based document retrieval system.
Anthology ID:
W18-2313
Volume:
Proceedings of the BioNLP 2018 workshop
Month:
July
Year:
2018
Address:
Melbourne, Australia
Editors:
Dina Demner-Fushman, Kevin Bretonnel Cohen, Sophia Ananiadou, Junichi Tsujii
Venue:
BioNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
118–128
Language:
URL:
https://aclanthology.org/W18-2313
DOI:
10.18653/v1/W18-2313
Bibkey:
Cite (ACL):
Manirupa Das, Eric Fosler-Lussier, Simon Lin, Soheil Moosavinasab, David Chen, Steve Rust, Yungui Huang, and Rajiv Ramnath. 2018. Phrase2VecGLM: Neural generalized language model–based semantic tagging for complex query reformulation in medical IR. In Proceedings of the BioNLP 2018 workshop, pages 118–128, Melbourne, Australia. Association for Computational Linguistics.
Cite (Informal):
Phrase2VecGLM: Neural generalized language model–based semantic tagging for complex query reformulation in medical IR (Das et al., BioNLP 2018)
Copy Citation:
PDF:
https://aclanthology.org/W18-2313.pdf