A voting scheme to detect semantic underspecification

Héctor Martínez Alonso, Núria Bel, Bolette Sandford Pedersen


Abstract
The following work describes a voting system to automatically classify the sense selection of the complex types Location/Organization and Container/Content, which depend on regular polysemy, as described by the Generative Lexicon (Pustejovsky, 1995) . This kind of sense alternations very often presents semantic underspecificacion between its two possible selected senses. This kind of underspecification is not traditionally contemplated in word sense disambiguation systems, as disambiguation systems are still coping with the need of a representation and recognition of underspecification (Pustejovsky, 2009) The data are characterized by the morphosyntactic and lexical enviroment of the headwords and provided as input for a classifier. The baseline decision tree classifier is compared against an eight-member voting scheme obtained from variants of the training data generated by modifications on the class representation and from two different classification algorithms, namely decision trees and k-nearest neighbors. The voting system improves the accuracy for the non-underspecified senses, but the underspecified sense remains difficult to identify
Anthology ID:
L12-1080
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
569–575
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/225_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Héctor Martínez Alonso, Núria Bel, and Bolette Sandford Pedersen. 2012. A voting scheme to detect semantic underspecification. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 569–575, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
A voting scheme to detect semantic underspecification (Alonso et al., LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/225_Paper.pdf