Sallam Abualhaija
2017
Metaheuristic Approaches to Lexical Substitution and Simplification
Sallam Abualhaija
|
Tristan Miller
|
Judith Eckle-Kohler
|
Iryna Gurevych
|
Karl-Heinz Zimmermann
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers
In this paper, we propose using metaheuristics—in particular, simulated annealing and the new D-Bees algorithm—to solve word sense disambiguation as an optimization problem within a knowledge-based lexical substitution system. We are the first to perform such an extrinsic evaluation of metaheuristics, for which we use two standard lexical substitution datasets, one English and one German. We find that D-Bees has robust performance for both languages, and performs better than simulated annealing, though both achieve good results. Moreover, the D-Bees–based lexical substitution system outperforms state-of-the-art systems on several evaluation metrics. We also show that D-Bees achieves competitive performance in lexical simplification, a variant of lexical substitution.
Parameter Transfer across Domains for Word Sense Disambiguation
Sallam Abualhaija
|
Nina Tahmasebi
|
Diane Forin
|
Karl-Heinz Zimmermann
Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017
Word sense disambiguation is defined as finding the corresponding sense for a target word in a given context, which comprises a major step in text applications. Recently, it has been addressed as an optimization problem. The idea behind is to find a sequence of senses that corresponds to the words in a given context with a maximum semantic similarity. Metaheuristics like simulated annealing and D-Bees provide approximate good-enough solutions, but are usually influenced by the starting parameters. In this paper, we study the parameter tuning for both algorithms within the word sense disambiguation problem. The experiments are conducted on different datasets to cover different disambiguation scenarios. We show that D-Bees is robust and less sensitive towards the initial parameters compared to simulated annealing, hence, it is sufficient to tune the parameters once and reuse them for different datasets, domains or languages.
Search
Fix data
Co-authors
- Karl-Heinz Zimmermann 2
- Judith Eckle-Kohler 1
- Diane Forin 1
- Iryna Gurevych 1
- Tristan Miller 1
- show all...