Towards a principled approach to sense clustering – a case study of wordnet and dictionary senses in Danish

Bolette Pedersen, Manex Agirrezabal, Sanni Nimb, Ida Olsen, Sussi Olsen


Abstract
Our aim is to develop principled methods for sense clustering which can make existing lexical resources practically useful in NLP – not too fine-grained to be operational and yet finegrained enough to be worth the trouble. Where traditional dictionaries have a highly structured sense inventory typically describing the vocabulary by means of mainand subsenses, wordnets are generally fine-grained and unstructured. We present a series of clustering and annotation experiments with 10 of the most polysemous nouns in Danish. We combine the structured information of a traditional Danish dictionary with the ontological types found in the Danish wordnet, DanNet. This constellation enables us to automatically cluster senses in a principled way and improve inter-annotator agreement and wsd performance.
Anthology ID:
2018.gwc-1.21
Volume:
Proceedings of the 9th Global Wordnet Conference
Month:
January
Year:
2018
Address:
Nanyang Technological University (NTU), Singapore
Venue:
GWC
SIG:
Publisher:
Global Wordnet Association
Note:
Pages:
182–189
Language:
URL:
https://aclanthology.org/2018.gwc-1.21
DOI:
Bibkey:
Cite (ACL):
Bolette Pedersen, Manex Agirrezabal, Sanni Nimb, Ida Olsen, and Sussi Olsen. 2018. Towards a principled approach to sense clustering – a case study of wordnet and dictionary senses in Danish. In Proceedings of the 9th Global Wordnet Conference, pages 182–189, Nanyang Technological University (NTU), Singapore. Global Wordnet Association.
Cite (Informal):
Towards a principled approach to sense clustering – a case study of wordnet and dictionary senses in Danish (Pedersen et al., GWC 2018)
Copy Citation:
PDF:
https://aclanthology.org/2018.gwc-1.21.pdf