Merging DanNet with Princeton Wordnet

Bolette Sandford Pedersen, Sanni Nimb, Ida Rørmann Olsen, Sussi Olsen


Abstract
In this paper we describe the merge of the Danish wordnet, DanNet, with Princeton Wordnet applying a two-step approach. We first link from the English Princeton core to Danish (5,000 base concepts) and then proceed to linking the rest of the Danish vocabulary to English, thus going from Danish to English. Since the Danish wordnet is built bottom-up from Danish lexica and corpora, all taxonomies are monolingually based and thus not necessarily directly compatible with the coverage and structure of the Princeton WordNet. This fact proves to pose some challenges to the linking procedure since a considerable number of the links cannot be realised via the preferred cross-language synonym link which implies a more or less precise correlation between the two concepts. Instead, a subpart of the links are realised through near synonym or hyponymy links to compensate for the fact that no precise translation can be found in the target resource. The tool WordnetLoom is currently used for manual linking but procedures for a more automatic procedure in future is discussed. We conclude that the two resources actually differ from each other quite more than expected, both vocabulary and structure-wise.
Anthology ID:
2019.gwc-1.16
Volume:
Proceedings of the 10th Global Wordnet Conference
Month:
July
Year:
2019
Address:
Wroclaw, Poland
Editors:
Piek Vossen, Christiane Fellbaum
Venue:
GWC
SIG:
SIGLEX
Publisher:
Global Wordnet Association
Note:
Pages:
125–134
Language:
URL:
https://aclanthology.org/2019.gwc-1.16
DOI:
Bibkey:
Cite (ACL):
Bolette Sandford Pedersen, Sanni Nimb, Ida Rørmann Olsen, and Sussi Olsen. 2019. Merging DanNet with Princeton Wordnet. In Proceedings of the 10th Global Wordnet Conference, pages 125–134, Wroclaw, Poland. Global Wordnet Association.
Cite (Informal):
Merging DanNet with Princeton Wordnet (Pedersen et al., GWC 2019)
Copy Citation:
PDF:
https://aclanthology.org/2019.gwc-1.16.pdf