Entity Disambiguation with Web Links

Andrew Chisholm, Ben Hachey


Abstract
Entity disambiguation with Wikipedia relies on structured information from redirect pages, article text, inter-article links, and categories. We explore whether web links can replace a curated encyclopaedia, obtaining entity prior, name, context, and coherence models from a corpus of web pages with links to Wikipedia. Experiments compare web link models to Wikipedia models on well-known conll and tac data sets. Results show that using 34 million web links approaches Wikipedia performance. Combining web link and Wikipedia models produces the best-known disambiguation accuracy of 88.7 on standard newswire test data.
Anthology ID:
Q15-1011
Volume:
Transactions of the Association for Computational Linguistics, Volume 3
Month:
Year:
2015
Address:
Cambridge, MA
Editors:
Michael Collins, Lillian Lee
Venue:
TACL
SIG:
Publisher:
MIT Press
Note:
Pages:
145–156
Language:
URL:
https://aclanthology.org/Q15-1011
DOI:
10.1162/tacl_a_00129
Bibkey:
Cite (ACL):
Andrew Chisholm and Ben Hachey. 2015. Entity Disambiguation with Web Links. Transactions of the Association for Computational Linguistics, 3:145–156.
Cite (Informal):
Entity Disambiguation with Web Links (Chisholm & Hachey, TACL 2015)
Copy Citation:
PDF:
https://aclanthology.org/Q15-1011.pdf
Code
 wikilinks/nel