Simple Compound Splitting for German

Marion Weller-Di Marco


Abstract
This paper presents a simple method for German compound splitting that combines a basic frequency-based approach with a form-to-lemma mapping to approximate morphological operations. With the exception of a small set of hand-crafted rules for modeling transitional elements, this approach is resource-poor. In our evaluation, the simple splitter outperforms a splitter relying on rich morphological resources.
Anthology ID:
W17-1722
Volume:
Proceedings of the 13th Workshop on Multiword Expressions (MWE 2017)
Month:
April
Year:
2017
Address:
Valencia, Spain
Venue:
MWE
SIG:
SIGLEX
Publisher:
Association for Computational Linguistics
Note:
Pages:
161–166
Language:
URL:
https://aclanthology.org/W17-1722
DOI:
10.18653/v1/W17-1722
Bibkey:
Cite (ACL):
Marion Weller-Di Marco. 2017. Simple Compound Splitting for German. In Proceedings of the 13th Workshop on Multiword Expressions (MWE 2017), pages 161–166, Valencia, Spain. Association for Computational Linguistics.
Cite (Informal):
Simple Compound Splitting for German (Weller-Di Marco, MWE 2017)
Copy Citation:
PDF:
https://aclanthology.org/W17-1722.pdf