Querying Word Embeddings for Similarity and Relatedness

Fatemeh Torabi Asr, Robert Zinkov, Michael Jones


Abstract
Word embeddings obtained from neural network models such as Word2Vec Skipgram have become popular representations of word meaning and have been evaluated on a variety of word similarity and relatedness norming data. Skipgram generates a set of word and context embeddings, the latter typically discarded after training. We demonstrate the usefulness of context embeddings in predicting asymmetric association between words from a recently published dataset of production norms (Jouravlev & McRae, 2016). Our findings suggest that humans respond with words closer to the cue within the context embedding space (rather than the word embedding space), when asked to generate thematically related words.
Anthology ID:
N18-1062
Volume:
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
Month:
June
Year:
2018
Address:
New Orleans, Louisiana
Editors:
Marilyn Walker, Heng Ji, Amanda Stent
Venue:
NAACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
675–684
Language:
URL:
https://aclanthology.org/N18-1062
DOI:
10.18653/v1/N18-1062
Bibkey:
Cite (ACL):
Fatemeh Torabi Asr, Robert Zinkov, and Michael Jones. 2018. Querying Word Embeddings for Similarity and Relatedness. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pages 675–684, New Orleans, Louisiana. Association for Computational Linguistics.
Cite (Informal):
Querying Word Embeddings for Similarity and Relatedness (Torabi Asr et al., NAACL 2018)
Copy Citation:
PDF:
https://aclanthology.org/N18-1062.pdf