Concept Grounding to Multiple Knowledge Bases via Indirect Supervision

Chen-Tse Tsai, Dan Roth


Abstract
We consider the problem of disambiguating concept mentions appearing in documents and grounding them in multiple knowledge bases, where each knowledge base addresses some aspects of the domain. This problem poses a few additional challenges beyond those addressed in the popular Wikification problem. Key among them is that most knowledge bases do not contain the rich textual and structural information Wikipedia does; consequently, the main supervision signal used to train Wikification rankers does not exist anymore. In this work we develop an algorithmic approach that, by carefully examining the relations between various related knowledge bases, generates an indirect supervision signal it uses to train a ranking model that accurately chooses knowledge base entries for a given mention; moreover, it also induces prior knowledge that can be used to support a global coherent mapping of all the concepts in a given document to the knowledge bases. Using the biomedical domain as our application, we show that our indirectly supervised ranking model outperforms other unsupervised baselines and that the quality of this indirect supervision scheme is very close to a supervised model. We also show that considering multiple knowledge bases together has an advantage over grounding concepts to each knowledge base individually.
Anthology ID:
Q16-1011
Volume:
Transactions of the Association for Computational Linguistics, Volume 4
Month:
Year:
2016
Address:
Cambridge, MA
Editors:
Lillian Lee, Mark Johnson, Kristina Toutanova
Venue:
TACL
SIG:
Publisher:
MIT Press
Note:
Pages:
141–154
Language:
URL:
https://aclanthology.org/Q16-1011
DOI:
10.1162/tacl_a_00089
Bibkey:
Cite (ACL):
Chen-Tse Tsai and Dan Roth. 2016. Concept Grounding to Multiple Knowledge Bases via Indirect Supervision. Transactions of the Association for Computational Linguistics, 4:141–154.
Cite (Informal):
Concept Grounding to Multiple Knowledge Bases via Indirect Supervision (Tsai & Roth, TACL 2016)
Copy Citation:
PDF:
https://aclanthology.org/Q16-1011.pdf
Data
FrameNet