Beat Kunz
2014
Zmorge: A German Morphological Lexicon Extracted from Wiktionary
Rico Sennrich
|
Beat Kunz
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
We describe a method to automatically extract a German lexicon from Wiktionary that is compatible with the finite-state morphological grammar SMOR. The main advantage of the resulting lexicon over existing lexica for SMOR is that it is open and permissively licensed. A recall-oriented evaluation shows that a morphological analyser built with our lexicon has comparable coverage compared to existing lexica, and continues to improve as Wiktionary grows. We also describe modifications to the SMOR grammar that result in a more conventional lemmatisation of words.