Towards a methodology for filtering out gaps and mismatches across wordnets: the case of plWordNet and Princeton WordNet

Ewa Rudnicka, Wojciech Witkowski, Łukasz Grabowski


Abstract
This paper presents the results of large-scale noun synset mapping between plWordNet, the wordnet of Polish, and Princeton WordNet, the wordnet of English, which have shown high predominance of inter-lingual hyponymy relation over inter-synonymy relation. Two main sources of such effect are identified in the paper: differences in the methodologies of construction of plWN and PWN and cross-linguistic differences in lexicalization of concepts and grammatical categories between English and Polish. Next, we propose a typology of specific gaps and mismatches across wordnets and a rule-based system of filters developed specifically to scan all I(inter-lingual)-hyponymy links between plWN and PWN. The proposed system, it should be stressed, also enables one to pinpoint the frequencies of the identified gaps and mismatches.
Anthology ID:
2016.gwc-1.49
Volume:
Proceedings of the 8th Global WordNet Conference (GWC)
Month:
27--30 January
Year:
2016
Address:
Bucharest, Romania
Editors:
Christiane Fellbaum, Piek Vossen, Verginica Barbu Mititelu, Corina Forascu
Venue:
GWC
SIG:
SIGLEX
Publisher:
Global Wordnet Association
Note:
Pages:
347–354
Language:
URL:
https://aclanthology.org/2016.gwc-1.49
DOI:
Bibkey:
Cite (ACL):
Ewa Rudnicka, Wojciech Witkowski, and Łukasz Grabowski. 2016. Towards a methodology for filtering out gaps and mismatches across wordnets: the case of plWordNet and Princeton WordNet. In Proceedings of the 8th Global WordNet Conference (GWC), pages 347–354, Bucharest, Romania. Global Wordnet Association.
Cite (Informal):
Towards a methodology for filtering out gaps and mismatches across wordnets: the case of plWordNet and Princeton WordNet (Rudnicka et al., GWC 2016)
Copy Citation:
PDF:
https://aclanthology.org/2016.gwc-1.49.pdf