Pattern-Based Extraction of Negative Polarity Items from Dependency-Parsed Text

Fabienne Fritzinger; Frank Richter; Marion Weller

Pattern-Based Extraction of Negative Polarity Items from Dependency-Parsed Text

Fabienne Fritzinger, Frank Richter, Marion Weller

Abstract

We describe a new method for extracting Negative Polarity Item candidates (NPI candidates) from dependency-parsed German text corpora. Semi-automatic extraction of NPIs is a challenging task since NPIs do not have uniform categorical or other syntactic properties that could be used for detecting them; they occur as single words or as multi-word expressions of almost any syntactic category. Their defining property is of a semantic nature, they may only occur in the scope of negation and related semantic operators. In contrast to an earlier approach to NPI extraction from corpora, we specifically target multi-word expressions. Besides applying statistical methods to measure the co-occurrence of our candidate expressions with negative contexts, we also apply linguistic criteria in an attempt to determine to which degree they are idiomatic. Our method is evaluated by comparing the set of NPIs we found with the most comprehensive electronic list of German NPIs, which currently contains 165 entries. Our method retrieved 142 NPIs, 114 of which are new.

Anthology ID:: L10-1182
Volume:: Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)
Month:: May
Year:: 2010
Address:: Valletta, Malta
Editors:: Nicoletta Calzolari, Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Mike Rosner, Daniel Tapias
Venue:: LREC
SIG:
Publisher:: European Language Resources Association (ELRA)
Note:
Pages:
Language:
External URL:: http://www.lrec-conf.org/proceedings/lrec2010/pdf/267_Paper.pdf
DOI:
Bibkey:
Cite (ACL):: Fabienne Fritzinger, Frank Richter, and Marion Weller. 2010. Pattern-Based Extraction of Negative Polarity Items from Dependency-Parsed Text. In Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10), Valletta, Malta. European Language Resources Association (ELRA).
Cite (Informal):: Pattern-Based Extraction of Negative Polarity Items from Dependency-Parsed Text (Fritzinger et al., LREC 2010)
Copy Citation:

External Cite Search Fix data