Mapping WordNet synsets to Wikipedia articles

Samuel Fernando, Mark Stevenson


Abstract
Lexical knowledge bases (LKBs), such as WordNet, have been shown to be useful for a range of language processing tasks. Extending these resources is an expensive and time-consuming process. This paper describes an approach to address this problem by automatically generating a mapping from WordNet synsets to Wikipedia articles. A sample of synsets has been manually annotated with article matches for evaluation purposes. The automatic methods are shown to create mappings with precision of 87.8% and recall of 46.9%. These mappings can then be used as a basis for enriching WordNet with new relations based on Wikipedia links. The manual and automatically created data is available online.
Anthology ID:
L12-1086
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
590–596
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/232_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Samuel Fernando and Mark Stevenson. 2012. Mapping WordNet synsets to Wikipedia articles. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 590–596, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
Mapping WordNet synsets to Wikipedia articles (Fernando & Stevenson, LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/232_Paper.pdf