The Lacunae of Danish Natural Language Processing

Andreas Kirkedal, Barbara Plank, Leon Derczynski, Natalie Schluter


Abstract
Danish is a North Germanic language spoken principally in Denmark, a country with a long tradition of technological and scientific innovation. However, the language has received relatively little attention from a technological perspective. In this paper, we review Natural Language Processing (NLP) research, digital resources and tools which have been developed for Danish. We find that availability of models and tools is limited, which calls for work that lifts Danish NLP a step closer to the privileged languages. Dansk abstrakt: Dansk er et nordgermansk sprog, talt primært i kongeriget Danmark, et land med stærk tradition for teknologisk og videnskabelig innovation. Det danske sprog har imidlertid været genstand for relativt begrænset opmærksomhed, teknologisk set. I denne artikel gennemgår vi sprogteknologi-forskning, -ressourcer og -værktøjer udviklet for dansk. Vi konkluderer at der eksisterer et fåtal af modeller og værktøjer, hvilket indbyder til forskning som løfter dansk sprogteknologi i niveau med mere priviligerede sprog.
Anthology ID:
W19-6141
Volume:
Proceedings of the 22nd Nordic Conference on Computational Linguistics
Month:
September–October
Year:
2019
Address:
Turku, Finland
Editors:
Mareike Hartmann, Barbara Plank
Venue:
NoDaLiDa
SIG:
Publisher:
Linköping University Electronic Press
Note:
Pages:
356–362
Language:
URL:
https://aclanthology.org/W19-6141
DOI:
Bibkey:
Cite (ACL):
Andreas Kirkedal, Barbara Plank, Leon Derczynski, and Natalie Schluter. 2019. The Lacunae of Danish Natural Language Processing. In Proceedings of the 22nd Nordic Conference on Computational Linguistics, pages 356–362, Turku, Finland. Linköping University Electronic Press.
Cite (Informal):
The Lacunae of Danish Natural Language Processing (Kirkedal et al., NoDaLiDa 2019)
Copy Citation:
PDF:
https://aclanthology.org/W19-6141.pdf
Data
Universal Dependencies