Generating a Large-Scale Entity Linking Dictionary from Wikipedia Link Structure and Article Text

Ravindra Harige, Paul Buitelaar


Abstract
Wikipedia has been increasingly used as a knowledge base for open-domain Named Entity Linking and Disambiguation. In this task, a dictionary with entity surface forms plays an important role in finding a set of candidate entities for the mentions in text. Existing dictionaries mostly rely on the Wikipedia link structure, like anchor texts, redirect links and disambiguation links. In this paper, we introduce a dictionary for Entity Linking that includes name variations extracted from Wikipedia article text, in addition to name variations derived from the Wikipedia link structure. With this approach, we show an increase in the coverage of entities and their mentions in the dictionary in comparison to other Wikipedia based dictionaries.
Anthology ID:
L16-1385
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2431–2434
Language:
URL:
https://aclanthology.org/L16-1385
DOI:
Bibkey:
Cite (ACL):
Ravindra Harige and Paul Buitelaar. 2016. Generating a Large-Scale Entity Linking Dictionary from Wikipedia Link Structure and Article Text. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 2431–2434, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Generating a Large-Scale Entity Linking Dictionary from Wikipedia Link Structure and Article Text (Harige & Buitelaar, LREC 2016)
Copy Citation:
PDF:
https://aclanthology.org/L16-1385.pdf