Finn Nielsen


2023

pdf bib
Alignment of Wikidata lexemes and Det Centrale Ordregister
Finn Nielsen
Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)

Two Danish open access lexicographic resources have appeared in recent years: lexemes in Wikidata and Det Centrale Ordregister (COR). The lexeme part of Wikidata describes words in different languages and COR associates an identifier with each different form of Danish lexemes. Here I described the current state of the linking Wikidata lexemes with COR and some of the problems encountered.

2020

pdf bib
Lexemes in Wikidata: 2020 status
Finn Nielsen
Proceedings of the 7th Workshop on Linked Data in Linguistics (LDL-2020)

Wikidata now records data about lexemes, senses and lexical forms and exposes them as Linguistic Linked Open Data. Since lexemes in Wikidata was first established in 2018, this data has grown considerable in size. Links between lexemes in different languages can be made, e.g., through a derivation property or senses. We present some descriptive statistics about the lexemes of Wikidata, focusing on the multilingual aspects and show that there are still relatively few multilingual links.
Search
Co-authors
    Venues