Description and acquisition of multiword lexemes

Angelika Storrer, Ulrike Schwall


Abstract
This paper deals with multiword lexemes (MWLs), focussing on two types of verbal MWLs: verbal idioms and support verb constructions. We discuss the characteristic properties of MWLs, namely non-standard compositionality, restricted substitutability of components, and restricted morpho-syntactic flexibility, and we show how these properties may cause serious problems during the analysis, generation, and transfer steps of machine translation systems. In order to cope with these problems, MT lexicons need to provide detailed descriptions of MWL properties. We list the types of information which we consider the necessary minimum for a successful processing of MWLs, and report on some feasibility studies aimed at the automatic extraction of German verbal multiword lexemes from text corpora and machine-readable dictionaries.
Anthology ID:
1993.eamt-1.3
Volume:
Third International EAMT Workshop: Machine Translation and the Lexicon
Month:
April 26–28
Year:
1993
Address:
Heidelberg, Germany
Editors:
Robert E. Frederking, Kathryn B. Taylor
Venue:
EAMT
SIG:
Publisher:
Springer Berlin Heidelberg
Note:
Pages:
35–50
Language:
URL:
https://aclanthology.org/1993.eamt-1.3
DOI:
Bibkey:
Cite (ACL):
Angelika Storrer and Ulrike Schwall. 1993. Description and acquisition of multiword lexemes. In Third International EAMT Workshop: Machine Translation and the Lexicon, pages 35–50, Heidelberg, Germany. Springer Berlin Heidelberg.
Cite (Informal):
Description and acquisition of multiword lexemes (Storrer & Schwall, EAMT 1993)
Copy Citation: