How Well Can We Predict Hypernyms from Word Embeddings? A Dataset-Centric Analysis

Ivan Sanchez, Sebastian Riedel


Abstract
One key property of word embeddings currently under study is their capacity to encode hypernymy. Previous works have used supervised models to recover hypernymy structures from embeddings. However, the overall results do not clearly show how well we can recover such structures. We conduct the first dataset-centric analysis that shows how only the Baroni dataset provides consistent results. We empirically show that a possible reason for its good performance is its alignment to dimensions specific of hypernymy: generality and similarity
Anthology ID:
E17-2064
Volume:
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers
Month:
April
Year:
2017
Address:
Valencia, Spain
Editors:
Mirella Lapata, Phil Blunsom, Alexander Koller
Venue:
EACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
401–407
Language:
URL:
https://aclanthology.org/E17-2064
DOI:
Bibkey:
Cite (ACL):
Ivan Sanchez and Sebastian Riedel. 2017. How Well Can We Predict Hypernyms from Word Embeddings? A Dataset-Centric Analysis. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, pages 401–407, Valencia, Spain. Association for Computational Linguistics.
Cite (Informal):
How Well Can We Predict Hypernyms from Word Embeddings? A Dataset-Centric Analysis (Sanchez & Riedel, EACL 2017)
Copy Citation:
PDF:
https://aclanthology.org/E17-2064.pdf