Criteria for Identifying and Annotating Caused Motion Constructions in Corpus Data

Jena D. Hwang, Annie Zaenen, Martha Palmer


Abstract
While natural language processing performance has been improved through the recognition that there is a relationship between the semantics of the verb and the syntactic context in which the verb is realized, sentences where the verb does not conform to the expected syntax-semantic patterning behavior remain problematic. For example, in the sentence “The crowed laughed the clown off the stage”, a verb of non-verbal communication laugh is used in a caused motion construction and gains a motion entailment that is atypical given its inherent lexical semantics. This paper focuses on our efforts at defining the semantic types and varieties of caused motion constructions (CMCs) through an iterative annotation process and establishing annotation guidelines based on these criteria to aid in the production of a consistent and reliable annotation. The annotation will serve as training and test data for classifiers for CMCs, and the CMC definitions developed throughout this study will be used in extending VerbNet to handle representations of sentences in which a verb is used in a syntactic context that is atypical for its lexical semantics.
Anthology ID:
L14-1499
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
1297–1304
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/624_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Jena D. Hwang, Annie Zaenen, and Martha Palmer. 2014. Criteria for Identifying and Annotating Caused Motion Constructions in Corpus Data. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 1297–1304, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
Criteria for Identifying and Annotating Caused Motion Constructions in Corpus Data (Hwang et al., LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/624_Paper.pdf