A Comparison of Word Embeddings for English and Cross-Lingual Chinese Word Sense Disambiguation

Hong Jin Kang; Tao Chen; Muthu Kumar Chandrasekaran; Min-Yen Kan

A Comparison of Word Embeddings for English and Cross-Lingual Chinese Word Sense Disambiguation

Hong Jin Kang, Tao Chen, Muthu Kumar Chandrasekaran, Min-Yen Kan

Abstract

Word embeddings are now ubiquitous forms of word representation in natural language processing. There have been applications of word embeddings for monolingual word sense disambiguation (WSD) in English, but few comparisons have been done. This paper attempts to bridge that gap by examining popular embeddings for the task of monolingual English WSD. Our simplified method leads to comparable state-of-the-art performance without expensive retraining. Cross-Lingual WSD – where the word senses of a word in a source language come from a separate target translation language – can also assist in language learning; for example, when providing translations of target vocabulary for learners. Thus we have also applied word embeddings to the novel task of cross-lingual WSD for Chinese and provide a public dataset for further benchmarking. We have also experimented with using word embeddings for LSTM networks and found surprisingly that a basic LSTM network does not work well. We discuss the ramifications of this outcome.

Anthology ID:: W16-4905
Volume:: Proceedings of the 3rd Workshop on Natural Language Processing Techniques for Educational Applications (NLPTEA2016)
Month:: December
Year:: 2016
Address:: Osaka, Japan
Editors:: Hsin-Hsi Chen, Yuen-Hsien Tseng, Vincent Ng, Xiaofei Lu
Venue:: NLP-TEA
SIG:
Publisher:: The COLING 2016 Organizing Committee
Note:
Pages:: 30–39
Language:
URL:: https://aclanthology.org/W16-4905/
DOI:
Bibkey:
Cite (ACL):: Hong Jin Kang, Tao Chen, Muthu Kumar Chandrasekaran, and Min-Yen Kan. 2016. A Comparison of Word Embeddings for English and Cross-Lingual Chinese Word Sense Disambiguation. In Proceedings of the 3rd Workshop on Natural Language Processing Techniques for Educational Applications (NLPTEA2016), pages 30–39, Osaka, Japan. The COLING 2016 Organizing Committee.
Cite (Informal):: A Comparison of Word Embeddings for English and Cross-Lingual Chinese Word Sense Disambiguation (Kang et al., NLP-TEA 2016)
Copy Citation:
PDF:: https://aclanthology.org/W16-4905.pdf

PDF Cite Search Fix data