Christian Wellner
2016
A Corpus of Literal and Idiomatic Uses of German Infinitive-Verb Compounds
Andrea Horbach
|
Andrea Hensler
|
Sabine Krome
|
Jakob Prange
|
Werner Scholze-Stubenrecht
|
Diana Steffen
|
Stefan Thater
|
Christian Wellner
|
Manfred Pinkal
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
We present an annotation study on a representative dataset of literal and idiomatic uses of German infinitive-verb compounds in newspaper and journal texts. Infinitive-verb compounds form a challenge for writers of German, because spelling regulations are different for literal and idiomatic uses. Through the participation of expert lexicographers we were able to obtain a high-quality corpus resource which offers itself as a testbed for automatic idiomaticity detection and coarse-grained word-sense disambiguation. We trained a classifier on the corpus which was able to distinguish literal and idiomatic uses with an accuracy of 85 %.
Search
Co-authors
- Andrea Horbach 1
- Andrea Hensler 1
- Sabine Krome 1
- Jakob Prange 1
- Werner Scholze-Stubenrecht 1
- show all...
Venues
- lrec1