Taghreed Alqaisi
2019
En-Ar Bilingual Word Embeddings without Word Alignment: Factors Effects
Taghreed Alqaisi
|
Simon O’Keefe
Proceedings of the Fourth Arabic Natural Language Processing Workshop
This paper introduces the first attempt to investigate morphological segmentation on En-Ar bilingual word embeddings using bilingual word embeddings model without word alignment (BilBOWA). We investigate the effect of sentence length and embedding size on the learning process. Our experiment shows that using the D3 segmentation scheme improves the accuracy of learning bilingual word embeddings up to 10 percentage points compared to the ATB and D0 schemes in all different training settings.
Search