GX at SemEval-2021 Task 2: BERT with Lemma Information for MCL-WiC Task

Wanying Xie


Abstract
This paper presents the GX system for the Multilingual and Cross-lingual Word-in-Context Disambiguation (MCL-WiC) task. The purpose of the MCL-WiC task is to tackle the challenge of capturing the polysemous nature of words without relying on a fixed sense inventory in a multilingual and cross-lingual setting. To solve the problems, we use context-specific word embeddings from BERT to eliminate the ambiguity between words in different contexts. For languages without an available training corpus, such as Chinese, we use neuron machine translation model to translate the English data released by the organizers to obtain available pseudo-data. In this paper, we apply our system to the English and Chinese multilingual setting and the experimental results show that our method has certain advantages.
Anthology ID:
2021.semeval-1.92
Volume:
Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021)
Month:
August
Year:
2021
Address:
Online
Editors:
Alexis Palmer, Nathan Schneider, Natalie Schluter, Guy Emerson, Aurelie Herbelot, Xiaodan Zhu
Venue:
SemEval
SIG:
SIGLEX
Publisher:
Association for Computational Linguistics
Note:
Pages:
706–712
Language:
URL:
https://aclanthology.org/2021.semeval-1.92
DOI:
10.18653/v1/2021.semeval-1.92
Bibkey:
Cite (ACL):
Wanying Xie. 2021. GX at SemEval-2021 Task 2: BERT with Lemma Information for MCL-WiC Task. In Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021), pages 706–712, Online. Association for Computational Linguistics.
Cite (Informal):
GX at SemEval-2021 Task 2: BERT with Lemma Information for MCL-WiC Task (Xie, SemEval 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.semeval-1.92.pdf
Code
 yingwaner/bert4wic
Data
WiC