Buildind a Resource of Patterns Using Semantic Types

Octavian Popescu


Abstract
While a word in isolation has a high potential of expressing various senses, in certain phrases this potential is restricted up to the point that one and only one sense is possible. A phrase is called sense stable if the senses of all the words compounding it do not change their sense irrespective of the context which could be added to its left or to its right. By comparing sense stable phrases we can extract corpus patterns. These patterns have slots which are filled by semantic types that capture the relevant information for disambiguation. The relationship between slots is such that a chain like disambiguation process is possible. Annotating a corpus with these kinds of patterns is beneficial for NLP, because problems such as data sparseness, noise, learning complexity are alleviated. We evaluate the inter agreement of annotators on examples coming from BNC.
Anthology ID:
L12-1628
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2999–3006
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/1055_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Octavian Popescu. 2012. Buildind a Resource of Patterns Using Semantic Types. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 2999–3006, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
Buildind a Resource of Patterns Using Semantic Types (Popescu, LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/1055_Paper.pdf