Hian-Beng Lee
Also published as: Hian Beng Lee
1999
Article selection using probabilistic sense disambiguation
Hian-Beng Lee
Proceedings of Machine Translation Summit VII
A probabilistic method is used for word sense disambiguation where the features taken are the surrounding six words. As their surface forms are used, no syntactic or semantic analysis is required. Despite its simplicity, this method is able to disambiguate the noun interest accurately. Using the common data set of (Bruce & Wiebe 94), we have obtained an average accuracy of 86.6% compared with their reported figure of 78%. This portable technique can be applied to the task of English article selection. This problem arises from machine translation of any source language without article to English. Using texts from the Wall Street Journal, we achieved an overall accuracy of 83.1% for the 1,500 most commonly used head nouns.
1996
Integrating Multiple Knowledge Sources to Disambiguate Word Sense: An Exemplar-Based Approach
Hwee Tou Ng
|
Hian Beng Lee
34th Annual Meeting of the Association for Computational Linguistics