The Seemingly (Un)systematic Linking Element in Danish

Sidsel Boldsen, Manex Agirrezabal


Abstract
The use of a linking element between compound members is a common phenomenon in Germanic languages. Still, the exact use and conditioning of such elements is a disputed topic in linguistics. In this paper we address the issue of predicting the use of linking elements in Danish. Following previous research that shows how the choice of linking element might be conditioned by phonology, we frame the problem as a language modeling task: Considering the linking elements -s/-∅ the problem becomes predicting what is most probable to encounter next, a syllable boundary or the joining element, ‘s’. We show that training a language model on this task reaches an accuracy of 94 %, and in the case of an unsupervised model, the accuracy reaches 80%.
Anthology ID:
W19-6144
Volume:
Proceedings of the 22nd Nordic Conference on Computational Linguistics
Month:
September–October
Year:
2019
Address:
Turku, Finland
Editors:
Mareike Hartmann, Barbara Plank
Venue:
NoDaLiDa
SIG:
Publisher:
Linköping University Electronic Press
Note:
Pages:
376–380
Language:
URL:
https://aclanthology.org/W19-6144
DOI:
Bibkey:
Cite (ACL):
Sidsel Boldsen and Manex Agirrezabal. 2019. The Seemingly (Un)systematic Linking Element in Danish. In Proceedings of the 22nd Nordic Conference on Computational Linguistics, pages 376–380, Turku, Finland. Linköping University Electronic Press.
Cite (Informal):
The Seemingly (Un)systematic Linking Element in Danish (Boldsen & Agirrezabal, NoDaLiDa 2019)
Copy Citation:
PDF:
https://aclanthology.org/W19-6144.pdf