Towards a Linking between WordNet and Wikidata

John P. McCrae, David Cillessen


Abstract
WordNet is the most widely used lexical resource for English, while Wikidata is one of the largest knowledge graphs of entity and concepts available. While, there is a clear difference in the focus of these two resources, there is also a significant overlap and as such a complete linking of these resources would have many uses. We propose the development of such a linking, first by means of the hapax legomenon links and secondly by the use of natural language processing techniques. We show that these can be done with high accuracy but that human validation is still necessary. This has resulted in over 9,000 links being added between these two resources.
Anthology ID:
2021.gwc-1.29
Volume:
Proceedings of the 11th Global Wordnet Conference
Month:
January
Year:
2021
Address:
University of South Africa (UNISA)
Editors:
Piek Vossen, Christiane Fellbaum
Venue:
GWC
SIG:
SIGLEX
Publisher:
Global Wordnet Association
Note:
Pages:
252–257
Language:
URL:
https://aclanthology.org/2021.gwc-1.29
DOI:
Bibkey:
Cite (ACL):
John P. McCrae and David Cillessen. 2021. Towards a Linking between WordNet and Wikidata. In Proceedings of the 11th Global Wordnet Conference, pages 252–257, University of South Africa (UNISA). Global Wordnet Association.
Cite (Informal):
Towards a Linking between WordNet and Wikidata (McCrae & Cillessen, GWC 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.gwc-1.29.pdf