Cross-Lingual Word Embeddings and the Structure of the Human Bilingual Lexicon

Paola Merlo, Maria Andueza Rodriguez


Abstract
Research on the bilingual lexicon has uncovered fascinating interactions between the lexicons of the native language and of the second language in bilingual speakers. In particular, it has been found that the lexicon of the underlying native language affects the organisation of the second language. In the spirit of interpreting current distributed representations, this paper investigates two models of cross-lingual word embeddings, comparing them to the shared-translation effect and the cross-lingual coactivation effects of false and true friends (cognates) found in humans. We find that the similarity structure of the cross-lingual word embeddings space yields the same effects as the human bilingual lexicon.
Anthology ID:
K19-1011
Volume:
Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL)
Month:
November
Year:
2019
Address:
Hong Kong, China
Editors:
Mohit Bansal, Aline Villavicencio
Venue:
CoNLL
SIG:
SIGNLL
Publisher:
Association for Computational Linguistics
Note:
Pages:
110–120
Language:
URL:
https://aclanthology.org/K19-1011
DOI:
10.18653/v1/K19-1011
Bibkey:
Cite (ACL):
Paola Merlo and Maria Andueza Rodriguez. 2019. Cross-Lingual Word Embeddings and the Structure of the Human Bilingual Lexicon. In Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL), pages 110–120, Hong Kong, China. Association for Computational Linguistics.
Cite (Informal):
Cross-Lingual Word Embeddings and the Structure of the Human Bilingual Lexicon (Merlo & Andueza Rodriguez, CoNLL 2019)
Copy Citation:
PDF:
https://aclanthology.org/K19-1011.pdf
Attachment:
 K19-1011.Attachment.zip