2015
pdf
bib
Corpus Patterns for Semantic Processing
Octavian Popescu
|
Patrick Hanks
|
Elisabetta Jezek
|
Daisuke Kawahara
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing: Tutorial Abstracts
2014
pdf
bib
abs
Disambiguating Verbs by Collocation: Corpus Lexicography meets Natural Language Processing
Ismail El Maarouf
|
Jane Bradbury
|
Vít Baisa
|
Patrick Hanks
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
This paper reports the results of Natural Language Processing (NLP) experiments in semantic parsing, based on a new semantic resource, the Pattern Dictionary of English Verbs (PDEV) (Hanks, 2013). This work is set in the DVC (Disambiguating Verbs by Collocation) project , a project in Corpus Lexicography aimed at expanding PDEV to a large scale. This project springs from a long-term collaboration of lexicographers with computer scientists which has given rise to the design and maintenance of specific, adapted, and user-friendly editing and exploration tools. Particular attention is drawn on the use of NLP deep semantic methods to help in data processing. Possible contributions of NLP include pattern disambiguation, the focus of this article. The present article explains how PDEV differs from other lexical resources and describes its structure in detail. It also presents new classification experiments on a subset of 25 verbs. The SVM model obtained a micro-average F1 score of 0.81.
pdf
bib
abs
Mapping CPA Patterns onto OntoNotes Senses
Octavian Popescu
|
Martha Palmer
|
Patrick Hanks
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
In this paper we present an alignment experiment between patterns of verb use discovered by Corpus Pattern Analysis (CPA; Hanks 2004, 2008, 2012) and verb senses in OntoNotes (ON; Hovy et al. 2006, Weischedel et al. 2011). We present a probabilistic approach for mapping one resource into the other. Firstly we introduce a basic model, based on conditional probabilities, which determines for any given sentence the best CPA pattern match. On the basis of this model, we propose a joint source channel model (JSCM) that computes the probability of compatibility of semantic types between a verb phrase and a pattern, irrespective of whether the verb phrase is a norm or an exploitation. We evaluate the accuracy of the proposed mapping using cluster similarity metrics based on entropy.
2013
pdf
bib
Corpus-driven Lexical Analysis: Norms and Exploitations in Word Use
Patrick Hanks
Proceedings of the Joint Symposium on Semantic Processing. Textual Inference and Structures in Corpora
2004
pdf
bib
Automated Induction of Sense in Context
James Pustejovsky
|
Patrick Hanks
|
Anna Rumshisky
Proceedings of the 5th International Workshop on Linguistically Interpreted Corpora
pdf
bib
Automated Induction of Sense in Context
James Pustejovsky
|
Patrick Hanks
|
Anna Rumshisky
COLING 2004: Proceedings of the 20th International Conference on Computational Linguistics
1990
pdf
bib
Word Association Norms, Mutual Information, and Lexicography
Kenneth Ward Church
|
Patrick Hanks
Computational Linguistics, Volume 16, Number 1, March 1990
1989
pdf
bib
abs
Parsing, Word Associations and Typical Predicate-Argument Relations
Kenneth Church
|
William Gale
|
Patrick Hanks
|
Donald Hindle
Proceedings of the First International Workshop on Parsing Technologies
There are a number of collocational constraints in natural languages that ought to play a more important role in natural language parsers. Thus, for example, it is hard for most parsers to take advantage of the fact that wine is typically drunk, produced, and sold, but (probably) not pruned. So too, it is hard for a parser to know which verbs go with which prepositions (e.g., set up) and which nouns fit together to form compound noun phrases (e.g., computer programmer). This paper will attempt to show that many of these types of concerns can be addressed with syntactic methods (symbol pushing), and need not require explicit semantic interpretation. We have found that it is possible to identify many of these interesting co-occurrence relations by computing simple summary statistics over millions of words of text. This paper will summarize a number of experiments carried out by various subsets of the authors over the last few years. The term collocation will be used quite broadly to include constraints on SVO (subject verb object) triples, phrasal verbs, compound noun phrases, and psychoiinguistic notions of word association (e.g., doctor/nurse).
pdf
bib
Word Association Norms, Mutual Information, and Lexicography
Kenneth Ward Church
|
Patrick Hanks
27th Annual Meeting of the Association for Computational Linguistics
pdf
bib
Parsing, Word Associations and Typical Predicate-Argument Relations
Kenneth Church
|
William Gale
|
Patrick Hanks
|
Donald Hindle
Speech and Natural Language: Proceedings of a Workshop Held at Cape Cod, Massachusetts, October 15-18, 1989