Nana Khvtisavrishvili
2016
GhoSt-NN: A Representative Gold Standard of German Noun-Noun Compounds
Sabine Schulte im Walde
|
Anna Hätty
|
Stefan Bott
|
Nana Khvtisavrishvili
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
This paper presents a novel gold standard of German noun-noun compounds (Ghost-NN) including 868 compounds annotated with corpus frequencies of the compounds and their constituents, productivity and ambiguity of the constituents, semantic relations between the constituents, and compositionality ratings of compound-constituent pairs. Moreover, a subset of the compounds containing 180 compounds is balanced for the productivity of the modifiers (distinguishing low/mid/high productivity) and the ambiguity of the heads (distinguishing between heads with 1, 2 and >2 senses
GhoSt-PV: A Representative Gold Standard of German Particle Verbs
Stefan Bott
|
Nana Khvtisavrishvili
|
Max Kisselew
|
Sabine Schulte im Walde
Proceedings of the 5th Workshop on Cognitive Aspects of the Lexicon (CogALex - V)
German particle verbs represent a frequent type of multi-word-expression that forms a highly productive paradigm in the lexicon. Similarly to other multi-word expressions, particle verbs exhibit various levels of compositionality. One of the major obstacles for the study of compositionality is the lack of representative gold standards of human ratings. In order to address this bottleneck, this paper presents such a gold standard data set containing 400 randomly selected German particle verbs. It is balanced across several particle types and three frequency bands, and accomplished by human ratings on the degree of semantic compositionality.