Konrad Wojtasik


2023

pdf bib
Wordnet for Definition Augmentation with Encoder-Decoder Architecture
Konrad Wojtasik | Arkadiusz Janz | Maciej Piasecki
Proceedings of the 12th Global Wordnet Conference

Data augmentation is a difficult task in Natural Language Processing. Simple methods that can be relatively easily applied in other domains like insertion, deletion or substitution, mostly result in changing the sentence meaning significantly and obtaining an incorrect example. Wordnets are potentially a perfect source of rich and high quality data that when integrated with the powerful capacity of generative models can help to solve this complex task. In this work, we use plWordNet, which is a wordnet of the Polish language, to explore the capability of encoder-decoder architectures in data augmentation of sense glosses. We discuss the limitations of generative methods and perform qualitative review of generated data samples.